Probes, methods and kits for detection and typing of Helicobacter pylori nucleic acids in biological samples

ABSTRACT

The present invention relates to a method for the detection and/or typing of  Helicobacter pylori  ( H. pylori  ) strains present in a sample including the steps of (i) amplifying the polynucleic acids of target regions of the vacA gene and the cagA gene, with suitable primer pairs, the primers being generally applicable on different  H. pylori  strains, where the target regions include a conserved region in the case of the cagA alleles and a variable region in the case of the vacA alleles; (ii) hybridizing the polynucleic acids obtained with a set of at least two VDG (virulence determinant gene)-derived probes, and with at least one of the probes hybridizing to a conserved region of a cagA of  H. pylori,  and with at least one of the probes hybridizing to a variable region of vacA; (iii) detecting the hybrids formed; and (iv) detecting and/or typing  H. pylori  strains present in a sample from the differential hybridization signals obtained. The present invention also relates to probes and primers for doing the same as well as  Helicobacter pylori  detecting/typing kits.

[0001] This invention relates to the field of the detection and typing of the human pathogen Helicobacter pylori, abbreviated as H.pylori below.

[0002] This invention relates to probes, primers, methods, and kits comprising the same for the detection and typing of nucleic acids of H.pylori in biological samples. H.pylori is the causative agent of chronic superficial gastritis in humans, and infection with this organism is a significant risk factor for the development of peptic ulcer disease and gastric cancer. (Blaser et al., 1992; Hentschel et al., 1993; Parsonnet et al., 1991)

[0003] The outcome of an infection with H.pylori is rather diverse, probably reflecting the large diversity within the species at the genetic level (Foxall et al., 1992; Akopyanz et al., 1992). However, most phenotypic characteristics are well conserved. As individuals can be infected with various strains, it will however be important to identify particular characteristics of different H.pylori strains that precisely determine risk among these strains.

[0004] Among the respective virulence determinants of H.pylori, two important genetic elements have been identified recently: the vacuolating toxin gene (vacA gene) and the cytotoxin associated gene (cagA gene) (Leunk et al. 1988: Cover and Blaser, 1992, 1995; Cover et al. 1992, 1994, Tummuru et al., 1993; Covacci et al., 1993).

[0005] The H.pylori vacuolating toxin induces cytoplasmic vacuolation in a large number of mammalian cell lines in vitro (Leunk et al. 1988), and produces epithelial cell damage and mucosal ulceration when administrated intragastrically to mice (Telford et al., 1993). The vacA gene encodes a 1287-1296 amino acid precursor which is processed (N- and C-terminally) to a 87-Kda secreted protein (Cover and Blaser, 1992; Cover et al., 1994; Telford et al., 1994; Schmitt and Haas, 1994; Phadnis et al., 1994). Although only 50% of the H.pylori strains induce vacuolation, nearly all strains hybridize to vacA probes (Cover et al., 1994; Telford et al. 1994; Schmitt and Haas, 1994; Phadnis et al., 1994). Very recently, Athenton et al., (1995) gave evidence for a mosaic organisation of the vacA gene, which indicated that specific vacA genotypes of H.pylori strains are associated with the level of cytotoxin activity in vitro as well as with the clinical consequences.

[0006] It was shown that three different classes of vacA signal sequences (s1a, s1b and s2) are present and two different classes of middle-region alleles (m1 and m2). All possible combinations of these vacA regions have been isolated with the exception of s2/ml. The production of cytotoxin activity was strongly linked to the presence of vacA alleles containing the s1-type signal peptide. None of the strains containing s2-type vacA alleles produced detectable cytotoxn activity. Also, a significant correlation between the occurrence of peptic ulceration and the presence of s1-type vacA alleles could be demonstrated.

[0007] A second putative virulence determinant is the high molecular weight protein encoded by the cytotoxin-associated gene, cagA (Tummuru et al., 1993; Covacci et al., 1993). About 60% of the H.pylori strains possess the cagA gene and nearly all of them express the cagA gene product. Production of the vacuolating cytotoxin in vitro and the presence of cagA are closely associated characteristics, although both genes are not tightly genetically linked (Tummuru et al., 1993; Covacci et al., 1993).

[0008] Based on immunoblot studies, it has been demonstrated that persons infected with cagA (+)-strains have higher degrees of gastric inflammation and epithelial cell damage in comparison to infections with cagA(−)-strains. Also, an inhanced expression of a number of cytolines has been found with respect to infection with cagA(+)-strains in comparison to cagA(−)-strains (Huang et al., 1995). As both the intensity of the inflammation and the degree of epithelial damage may be determining the pathogenesis of gastric cancer, the examination of the presence or abscence of the cagA gene upon H.pylori infection is important.

[0009] In this invention, it is disclosed for the first time that the methods described by Atherton et al., 1995 are not suitable to type H.pylori strains present in a number of clinical samples obtained from patients of the Netherlands and Portugal (see example 1). Moreover, the typing method described by these authors involves the resolution of gene-amplification products by agarose gel electrophoresis, a tedious and not highly reliable technique when applied on large number of samples.

[0010] Thus, with respect to the nessecity to evaluate large populations to provide statistically relevant data concerning the linkage between a type of H.pylori strains and any pathogenic phenotype and in view of the need for a rapid, simple and highly reliable typing method in order to determine the applicable eradication strategy at the clinical stage, the above method described by Atherton et al., 1995 is less appropriate.

[0011] It is an aim of this present invention to provide a rapid, sensitive and reliable method to detect and type H. pylori strains in biological samples.

[0012] More particularly, it is an aim of the present invention to provide a rapid, sensitive and reliable method to detect and/or type H.pylori strains in biological samples, associated with the development of chronic active gastritis and/or gastric and duodenal ulcers, and/or gastic adenocarcinomas and/or mucosa-associated lymphoid tissue lymphomas, and/or to determine the applicable eradication therapy.

[0013] It is an aim of the present invention to provide a rapid, sensitive and reliable method to detect and type H.pylori strains present in biological samples, directly coupled to the detection and/or the typing of the alleles of the virulence determinant genes present, including at least the vacA gene.

[0014] More particularly, it is an aim of the present invention to provide a rapid, sensitive and reliable method to detect and type H. pylori strains present in a biological sample, directly coupled to the detection and/or the typing of the vacA and cagA alleles present.

[0015] It is the aim of the present invention to define suitable probes enabling the detection and/or allele-specific typing of H.pylori strains based on the alleles of the virulence determinant genes present, including at least one probe derived from vacA.

[0016] More particularly, it is an aim of the present invention to define suitable probes enabling the detection and/or allele-specific typing of H.pylori strains based on the alleles of the vacA and cagA virulence determinant genes present.

[0017] It is moreover an aim of the present invention to combine the suitable probes enabling detection and/or allele-specific typing of H.pylori strains based on the alleles of the virulence determinant genes present, including at least the vacA gene, whereby all said probes can preferentially be used simultaneously in a multiparameter type of assay, more particularly under the same hybridisation and wash-conditions.

[0018] More particularly, it is an aim of the present invention to combine the suitable probes enabling detection and/or allele-specific typing of H.pylori strains based on the alleles of the vacA and cagA genes present, whereby all probes can be preferentially used simultanously under the same hybridisation and wash-conditions.

[0019] More particularly, it is an aim of this invention to develop suitable probes of relevant target regions of the VDG, including at least the vacA gene, said target regions comprising either a variable region, either a conserved region of the VDG, said probes being applicable, if appropriate, in a simultanous hybridisation assay.

[0020] Even more particularly, it is an aim of this invention to develop suitable probes of relevant target regions of the vacA and cagA genes, said target regions comprising a variable region in case of the vacA gene and a conserved region in case of the cagA gene, said probes being applicable, if appropriate, in a simultanous hybridisation assay.

[0021] Most particularly, it is an aim of this invention to design suitable probes comprising the highly variable S- and M-regions in the vacA gene, said S-region being comprised between the nucleotides at position 1 and 300, and said M-regions being comprised between the nucleotides at the position 1450 and 1650, and a common probe in the case of the cagA gene comprising preferentially the highly conserved region between the nucleotide at the position 17 and the nucleotide at the position 113 of the cagA gene of H.pylori, if appropriate, in a simultanous hybridisation assay.

[0022] It is also an aim of the present invention to select primers enabling the amplification of relevant target regions of alleles of the virulence determinant gene of interest of H.pylori including at least the vacA gene, said amplification being universal for the respective target regions, said target regions comprising either a variable region, or a conserved region of the VDG.

[0023] It is more particularly an aim of the present invention to select primers enabling the amplification of the relevant target regions of the alleles of the vacA and cagA virulence determinant genes of the H.pylori, said primers being being generally applicable with H.pylori strains and allowing the amplification of said relevant target regions to be used in compatible amplification conditions said amplification being universal for the respective vacA and cagA alleles present.

[0024] Most particularly, it is an aim of the present invention to select primers enabling the amplification of the highly variable S- and M-regions in the vacA gene, said S-region being comprised between the nucleotide at position 1 and 300, said M-region being comprised between the nucleotides at the position 1450 and 1650, and the highly conserved region between the nucleotide at the position 1 and the nucleotide at the position 250 of the open reading frame of the cagA gene of H.pylori, by preference in a single amplification reaction It is also an aim of the present invention to provide kits for the detection and/or typing of H.pylori strains.

[0025] More particularly, it is an aim of this invention to provide a kit for the detection and/or typing of H.pylori strains directly coupled to the detection and/or the typing of the alleles of the virulence determinant genes present, including at least the vacA gene.

[0026] Even more particularly, it is an aim of this invention to provide a kit for the detection and/or typing of H.pylori strains based on the detection and/or typing of the alleles of the vacA and cagA genes present.

[0027] Most preferentially, it is an aim of this invention to provide a kit for the detection and/or typing of H.pylori strains based on the detection and/or typing of the highly variable S- and M-regions in the vacA gene and,the highly conserved region between the nucleotide at the position 1 and the nucleotide at the position 250 of the cagA gene of H.pylori.

[0028] All the aims of the present invention have been met by the following specific embodiments. The selection of the probes (except for probes with SEQ ID NO 35 to 39) according to the present invention is based on the Line Probe Assay (LiPA) principle, as exemplified in the Examples section. The LiPA is a reverse hybridization assay using oligonucleotide probes immobilized as parallel lines on a solid support strip (Stuyver et al. 1993; international application WO 94/12670). This approach is particularly advantageous since it is fast and simple to perform. The reverse hybridization format and more particularly the LiPA approach has many practical advantages as compared to other DNA techniques or hybridization formats, especially when the use of a combination of probes is preferable or unavoidable to obtain the relevant information sought. As such, the LiPA is a particularly appropriate method to detect and or type (micro)-organisms in general and H.pylori in particular. The probes with SEQ ID NO 35 to 39 are designed for use in a DNA Enzyme Immuno Assay, as shown in example 8. This assay is particularly convenient for a rapid detection method.

[0029] It is to be understood, however, that any other type of hybridization assay or hybridization format using any of the selected probes as described further in the invention, is also covered by the present invention.

[0030] The reverse hybridization approach implies that the probes are immobilized to a solid support and that the target DNA is labelled in order to enable the detection of the hybrids formed.

[0031] The following definitions serve to illustrate the terms and expressions used in the present invention.

[0032] The target material in the samples envisaged in the present invention may either be DNA or RNA e.g. genomic DNA or messenger RNA or amplified versions thereof. These molecules are also termed polynucleic acids.

[0033] The relevant target regions will in principle be all polynucleic acid sequences comprising a virulence determinant gene, said virulence determinant gene being the genetic element involved in enabling, determining, and marking of the infectivity and/or pathogenecity of H.pylori, more specifically all polynucleic acid sequences comprising the virulence determinant genes vacA and cagA, and even more specifically any conserved region in the cagA gene, said conserved region being defined as more being more than 95% identical between alleles of different H.pylori strains, and most specifically the variable S- and M-regions of the vacA gene. In addition to variable sequences, the S-region of the vacA gene also comprises conserved sequences, which may be chosen as target regions for probes for detection—without typing—of H. pylori according to the present invention.

[0034] The term “probe” refers to single stranded sequence-specific oligonucleotides which have a sequence which is complementary to the target sequence to be detected.

[0035] The term complementary as used herein means that the sequence of the single stranded probe is exactly hybridizing to the sequence of the single-stranded target, with the target being defined as the sequence where the mutation to be detected is located. Since the current application requires the detection of single basepair mismatches, very stringent conditions for hybridization are required, allowing in principle only hybridization of exactly complementary sequences. However, variations are possible in the length of the probes (see below), and it should be noted that, since the central part of the probe is essential for its hybridization characteristics, possible deviations of the probe sequence versus the target sequence may be allowable towards head and tail of the probe, when longer probe sequences are used. These variations, which may be conceived from the common knowledge in the art, should however always be evaluated experimentally, in order to check if they result in equivalent hybridization characteristics compared to the exactly complementary probes.

[0036] Preferably, the probes are about 5 to 50 nucleotides long, more preferably from about 10 to 25 nucleotides. The nucleotides as used in the present invention may be ribonucleotides, deoxyribonucleotides and modified nucleotides such as inosine or nucleotides containing modified groups which do not essentially alter their hybridisation characteristics.

[0037] Probe sequences are represented throughout the specification as single stranded DNA oligonucleotides from the 5′ to the 3′ end. It is obvious to the man skilled in the art that any of the below-specified probes can be used as such, or in their complementary form, or in their RNA form (wherein T is replaced by U).

[0038] The probes according to the invention can be prepared by cloning of recombinant plasmids containing inserts including the corresponding nucleotide sequences, if need be by cleaving the latter out from the cloned plasmids upon using the adequate nucleases and recovering them, e.g. by fractionation according to molecular weight. The probes according to the present invention can also be synthesized chemically, for instance by the conventional phospho-triester method.

[0039] The term “solid support” can refer to any substrate to which an oligonucleotide probe can be coupled, provided that it retains its hybridization characteristics and provided that the background level of hybridization remains low. Usually the solid substrate will be a microtiter plate, a membrane (e.g. nylon or nitrocellulose) or a microsphere (bead) or a chip. Prior to application to the membrane or fixation it may be convenient to modify the nucleic acid probe in order to facilitate fixation or improve the hybridization efficiency. Such modifications may encompass homopolymer tailing, coupling with different reactive groups such as aliphatic groups, NH₂ groups, SH groups, carboxylic groups, or coupling with biotin, haptens or proteins.

[0040] The term “labelled” refers to the use of labelled nucleic acids. Labelling may be carried out by the use of labelled nucleotides incorporated during the polymerase step of the amplification such as illustrated by Saiki et al. (1988) or Bej et al. (1990) or labelled primers, or by any other method known to the person skilled in the art. The nature of the label may be isotopic (³²P, ³⁵S, etc.) or non-isotopic (biotin digoxigenin, etc.).

[0041] The term “primer” refers to a single stranded oligonucleotide sequence capable of acting as a point of initiation for synthesis of a primer extension product which is complementary to the nucleic acid strand to be copied. The length and the sequence of the primer must be such that they allow to prime the synthesis of the extension products. Preferably the primer is about 5-50 nucleotides long. Specific length and sequence will depend on the complexity of the required DNA or RNA targets, as well as on the conditions of primer use such as temperature and ionic strenght.

[0042] The fact that amplification primers do not have to match exactly with the corresponding template sequence to warrant proper amplification is amply documented in the literature (Kwok et al., 1990).

[0043] The amplification method used can be either polymerase chain reaction (PCR; Saiki et al., 1988), ligase chain reaction (LCR; Landgren et al., 1988; Wu & Wallace, 1989; Barany, 1991), nucleic acid sequence-based amplification (NASBA; Guatelli et al., 1990; Compton, 1991), transcription-based amplification system (TAS; Kwoh et al., 1989), strand displacement amplification (SDA; Duck, 1990; Walker et al., 1992) or amplification by means of Qβ replicase (Lizardi et al., 1988; Lomeli et al., 1989) or any other suitable method to amplify nucleic acid molecules known in the art.

[0044] The oligonucleotides used as primers or probes may also comprise nucleotide analogues such as phosphorothiates (Matsukura et a.l, 1987), alkylphosphorothiates (Miller et al., 1979) or peptide nucleic acids (Nielsen et al., 1991; Nielsen et al., 1993) or may contain intercalating agents (Asseline et al., 1984).

[0045] As for most other variations or modifications introduced into the original DNA sequences of the invention, these variations will necessitate adaptations with respect to the conditions under which the oligonucleotide should be used to obtain the required specificity and sensitivity. However the eventual results of hybridisation will be essentially the same as those obtained with the unmodified oligonucleotides.

[0046] The introduction of these modifications may be advantageous in order to positively influence characteristics such as hybridization kinetics, reversibility of the hybrid-formation, biological stability of the oligonucleotide molecules, etc.

[0047] The “sample” may be any biological material taken either directly from the infected huts being (or animal), or after culturing (enrichment), or collected from any other environment. Biological material may be e.g. expectorations of any kind, broncheolavages, blood, skin tissue, biopsies, lymphocyte blood culture material colonies, liquid cultures, soil, faecal samples, urine, surface water, etc.

[0048] The probes of the invention are designed for attaining optimal performance under the same hybridization conditions so that they can be used in sets for simultaneous hybridization; this highly increases the usefulness of these probes and results in a significant gain in time and labour. Evidently, when other hybridization conditions would be preferred, all probes should be adapted accordingly by adding or deleting a number of nucleotides at their extremities. It should be understood that these concommitant adaptations should give rise to essentially the same result, namely that the respective probes still hybridize specifically with the defined target. Such adaptations might also be necessary if the amplified material should be RNA in nature and not DNA as in the case for the NASBA system.

[0049] For designing probes wit desired characteristics, the following useful guidelines known to the person skilled in the art can be applied.

[0050] Because the extent and specificity of hybridization reactions such as those described herein are affected by a number of factors, manipulation of one or more of those factors will determine the exact sensitivity and specificity of a particular probe, whether perfectly complementary to its target or not. The importance and effect of various assay conditions, explained further herein, are known to those skilled in the art.

[0051] First, the stability of the [probe:target] nucleic acid hybrid should be chosen to be compatible with the assay conditions. This may be accomplished by avoiding long AT-rich sequences, by terminating the hybrids with G:C base pairs, and by designing the probe with an appropriate Tm. The beginning and end points of the probe should be chosen so that the length and % GC result in a Tm about 2-10° C. higher than the temperature at which the final assay will be performed. The base composition of the probe is significant because G-C base pairs exhibit greater thermal stability as compared to A-T base pairs due to additional hydrogen bonding. Thus, hybridization involving complementary nucleic acids of higher G-C content will be stable at higher temperatures.

[0052] Conditions such as ionic strength and incubation temperature under which a probe will be used should also be taken into account when designing a probe. It is known that hybridization will increase as the ionic strength of the reaction mixture increases, and that the thermal stability of the hybrids will increase with increasing ionic strength. On the other hand, chemical reagents, such as formamide, urea, DMSO and alcohols, which disrupt hydrogen bonds, will increase the stringency of hybridization. Destabilization of the hydrogen bonds by such reagents can greatly reduce the T_(m). In general optimal hybridization for synthetic oligonucleotide probes of about 10-50 bases in length occurs approximately 5° C. below the melting temperature for a given duplex. Incubation at temperatures below the optimum may allow mismatched base sequences to hybridize and can therefore result in reduced specificity. It is desirable to have probes which hybridize only under conditions of high stringency. Under high stringency conditions only highly complementary nucleic acid hybrids will form; hybrids without a sufficient degree of complementarity will not form. Accordingly, the stringency of the assay conditions determines the amount of complementarity needed between two nucleic acid strands forming a hybrid. The degree of stringency is chosen such as to maximize the difference in stability between the hybrid formed with the target and the nontarget nucleic acid. Second, probes should be positioned so as to minimize the stability of the [probe:nontarget] nucleic acid hybrid. This may be accomplished by minimizing the length of perfect complementarity to non-target organisms, by avoiding GC-rich regions of homology to non-target sequences, and by positioning the probe to span as many destabilizing mismatches as possible. Whether a probe sequence is useful to detect only a specific type of organism depends largely on the thermal stability difference between [probe:target] hybrids and [probe:nontarget] hybrids. In designing probes, the differences in these Tm values should be as large as possible (e.g. at least 2° C. and preferably 5° C.).

[0053] The length of the target nucleic acid sequence and, accordingly, the length of the probe sequence can also be important. In some cases, there may be several sequences from a particular region, varying in location and length, which will yield probes with the desired hybridization characteristics. In other cases, one sequence may be significantly better than another which differs merely by a single base. While it is possible for nucleic acids that are not perfectly complementary to hybridize, the longest stretch of perfectly complementary base sequence will normally primarily determine hybrid stability. While oligonucleotide probes of different lengths and base composition may be used, preferred oligonucleotide probes of this invention are between about 5 to 50 (more particularely 10-25) bases in length and have a sufficient stretch in the sequence which is perfectly complementary to the target nucleic acid sequence.

[0054] Third, regions in the target DNA or RNA which are known to form strong internal structures inhibitory to hybridization are less preferred. Likewise, probes with extensive self-complementarity should be avoided. As explained above, hybridization is the association of two single strands of complementary nucleic acids to form a hydrogen bonded double strand. It is implicit that if one of the two strands is wholly or partially involved in a hybrid that it will be less able to participate in formation of a new hybrid. There can be intramolecular and intermolecular hybrids formed within the molecules of one type of probe if there is sufficient self complementarity. Such structures can be avoided through carefull probe design. By designing a probe so that a substantial portion of the sequence of interest is single stranded, the rate and extent of hybridization may be greatly increased. Computer programs are available to search for this type of interaction. However, in certain instances, it may not be possible to avoid this type of interaction.

[0055] The present invention provides in its most general form a method for the detection and/or typing of Helicobacter pylori (H.pylori) strains present in a sample comprising the steps of:

[0056] (i) if need be releasing, isolating or concentrating the polynucleic acids in the sample;

[0057] (ii) amplifying the polynucleic acids of relevant target regions of the vacA gene and possibly other virulence determinant genes (VDG), with suitable primer pairs, said primers being generally applicable on different H.pylori strains, allowing to amplify said relevant target regions of the VDG preferentially in compatible amplification conditions;

[0058] (iii) hybridizing the polynucleic acids obtained in (i) or (ii) with a set of at least two VDG-derived probes, under appropriate hybridization and wash conditions, and with at least one of said probes hybridizing to a conserved region of a VDG of H.pylori, and with at least one of said probes hybridizing to a variable region of vacA;

[0059] (iv) detecting the hybrids formed in step (iii);

[0060] (v) detecting and/or typing H.pylori strains present in a sample from the differential hybridization signals obtained in step (iv).

[0061] Said typing represents the allele-specific detection of a strain according to the VDG alleles present in that particular H.pylori strain. Said virulence determinant genes represent the genetic elements involved in enabling, determining, and marking of the infectivity and/or pathogenicity of said H.pylori strain. Said method is referred to below as “detection/typing method”.

[0062] The relevant target regions will be derived from polynucleic acid sequences comprising a virulence determinant gene specific of H.pylori, with said relevant target region being either a conserved region in a VDG, or a variable region of a VGD. The relevant target regions of the virulence determinant genes relate either to any conserved region in known VDG, allowing detection of the presence of this VDG in the H.pylori strains in a sample, or to any variable region in known VDG allowing allele-specific typing of the H.pylori present in a sample.

[0063] According to a preferred embodiment of the present invention, step (ii) and (iii) are performed using primers and probes meticulously designed such that they show the desired amplification or hybridization results, when used, if appropriate under compatible amplification or hybridization and wash conditions.

[0064] More specifically, the present invention provides a method for the detection and/or typing of H. pylori strains present in a sample with respect to the development of chronic active gasts and/or gastric and duodenal ulcers and/or gastric adenocarcinomas and/or mucosa-associated lymphoid tissue lymphomas and/or determining eradication therapy.

[0065] The cagA gene and the vacA gene are representatives of the virulence determinant genes of H.pylori. Relevant conserved target regions of alleles of the cagA gene can be used to detect the presence of his gene in H.pylori strains present in a sample. In addition, identified variable regions in alleles of the vacA gene can be used to type in an allel-specific way the respective H.pylori strains. By preference said conserved target regions of alleles of the cagA gene include the region spanning the nucleotide at position 1 to the nucleotide at the position 250 of the open reading frame, with said numbering being according to Genbank accessions L11741 (HECMAJANT) or X70039 (HPCAI); also, by preference the identified variable regions of alleles of the vacA gene include the identified S- and M-region of the vacA gene, said S-region being comprised between the nucleotides at position 1 and 300, said M-region being comprised between the nucleotides at the position 1450 and 1650, with said numbering being according to Genbank accessions UO5676 or U29401.

[0066] Standard hybridization and wash conditions are for instance 2×SSC (Sodium Saline Citrate), 0.1% SDS at 50° C. Other solutions (SSPE (Sodium Saline phosphate EDTA), TMACl (Tetramethyl ammonium Chloride), etc) and temperatures can also be used provided that the specificity and sensitivity of the probes is maintained. If need be, slight modifications of the probes in length or in sequence might have to be carried out in order to maintain the specificity and sensitivity required under the given conditions. Suitable primers can for instance be chosen form a list of primers described below.

[0067] In a more preferential embodiment, the above mentioned polynucleic acids from step (ii) are hybridized with at least two, three, four, five or more of the above mentioned cagA- or vacA-derived probes, which cover respectively a conserved region of the cagA gene and a variable region of the vacA gene.

[0068] Also, in a more preferential embodiment, the above mentioned polynucleic acids from step (i) and (ii) are hybridized with at least one vacA-derived probe directed to at least one identified variable region of the alleles of the vacA gene, by preference including at least one of the vacA-derived probes SEQ ID NO 2 to 11 and 28 to 34.

[0069] It should be stressed that all of the above-mentioned probes, including the allele-specific probes, are contained in the sequence of specific virulence determinant genes of H.pylori, including more particularly the cagA gene or the vacA gene, said probes comprising either a conserved region of the cagA gene, or comprising a variable region of the vacA gene. The probes are preferably designed in such a way that they can all be used simultanously, under the same hybridization and wash conditions. Both criteria imply that preferentially a single amplification and hybridization step is sufficient for the simultanous detection and typing of H.pylori strains present in a sample.

[0070] The present invention relates more particularly to a method as defined above wherein step (ii) consists of amplifying the polynucleic acids of relevant target regions in the vacA and cagA gene with sable sets of primers, said primers being generally applicable on different H. pylori strains, allowing to amplify said relevant target regions in compatible amplification conditions, with said target region being a conserved region in the case of the cagA alleles and a variable region in the case of the vacA alleles, and with said sets of primers being preferentially chosen from the following list of primers as given in Table I: cagF (SEQ ID NO 12) cagR (SEQ ID NO 13) VA1XR (SEQ ID NO 14) VA1F (Atherton et al, 1995) M1F (SEQ ID NO 15) M1R (SEQ ID NO 16) HPMGF (SEQ ID NO 17) HPMGR (SEQ ID NO 18) cagSF (SEQ ID NO 19) cagSR (SEQ ID NO 20) cagEN1 (SEQ ID NO 21) cagRN1 (SEQ ID NO 22) VAMSFb (SEQ ID NO 23) VAMSFc (SEQ ID NO 24) VAMSFd (SEQ ID NO 25) VAMSFe (SEQ ID NO 26)

[0071] or sequence variants thereof, with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—,by others (including modified nucleotides such as inosine), or with said variants consisting of the complement of any of the above-mentioned oligonucleotide primers, or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize/amplify specifically with the same specificity as the oligonucleotide primers from which they are derived.

[0072] Primers cagF and cagR are derived from two published sequences of cagA alleles (Cocacci et al., 1993; Tummuru et al., 1993). The present invention provides novel nucleic acid sequences encoding 149-154 amino acids of the N-terminus of the cagA protein, as disclosed in FIG. 10 (see also example 5). Based on these novel sequences, improved primers were designed for amplification of a relevant target region of the cagA gene. These primers are: cagSF(forward) (SEQ ID NO 19) cagSR(reverse) (SEQ ID NO 20)

[0073] The sequence of these primers is shown in table 1. Study of the alignment of sequences shown in FIG. 10 shows that primers cagSF and cagSR will not hybridize to the polynucleic acids of isolates from East Asia. Therefore, even more improved primers were designed, that will also permit amplification of these sequences. These primers are: cagFN1(forward) (SEQ ID NO 21) cagRN1(reverse) (SEQ ID NO 22)

[0074] The sequence of these primers is shown in table 1. Primers cagSF and cagSR can of course be used when amplification of polynucleic acids of isolates from East Asia is not required. Primers M1F, M1R, HPMGF and HPMGR are based on the sequences of the M-region of the vac A gene, shown in FIGS. 2 and 3, said sequences being provided by the present invention. In a second instance, the present invention discloses additional sequences for the M-region, as shown in FIG. 14 (see example 7). Based on these sequences, improved forward primers were designed that may preferentially be used instead of primer M1F, in combination with reverse primer M1R These primers are: VAMSFb(forward) (SEQ ID NO 23) VAMSFc(forward) (SEQ ID NO 24) VAMSFd(forward) (SEQ ID NO 25) VAMSFe(forward) (SEQ ID NO 26)

[0075] The sequence of these primers is shown in table 1. In order to obtain amplification of polynucleic acids from a maximal number of isolates, primers VAMSFb, VAMSFc, VAMSFd and VAMSFe should be combined in one PCR reaction.

[0076] According to a preferred embodiment, the present invention also relates to a method as defined above wherein step (iii) consists of hybridizing the polynucleic acids obtained in step (ii) with a set of probes, under appropriate hybridization and wash conditions, said set of probes being preferentially applicable in a simultaneous hybridisation assay and comprising at least one probe hybridizing to a conserved region of the cagA gene of H.pylori and at least one probe hybridizing to a variable region of the vacA gene of H.pylori, and more preferentially said set of probes comprising at least one of the following cagA- and vacA-derived probes as defined in Table 2 and in FIGS. 2 to 3: cag A-derived probe(s): cagApro (SEQ ID NO1) cagprobe3 (SEQ ID NO 27) vacA-derived probe(s): P1S1 (SEQ ID NO 2) P22S1a (SEQ ID NO 3) P1S1b (SEQ ID NO 4) P2S1b (SEQ ID NO 5) P1S2(VAS2) (SEQ ID NO 6) P2S2 (SEQ ID NO 7) P1M1 (SEQ ID NO 8) P2M1 (SEQ ID NO 9) P1M2 (SEQ ID NO 10) P2M2 (SEQ ID NO 11) P3S1 (SEQ ID NO 28) P4S1 (SEQ ID NO 29) P1M1new (SEQ ID NO 30) P2M1new (SEQ ID NO 31) P1M2new (SEQ ID NO 32) P2M2new (SEQ ID NO 33) P1M3 (SEQ ID NO 34)

[0077] or sequence variants thereof, with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—,by others (including modified nucleotides such as inosine), or with said variants consisting of the complement of any of the above-mentioned oligonucleotide probes, or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize specifically with the same specificity as the oligonucleotide probes from which they are derived.

[0078] Probe cagApro was derived from published sequences of cagA alleles (Covacci et al., 1993; Tummuru et al., 1993). Based on the above-mentioned novel sequences of the cagA gene (FIG. 10), provided by the present invention, an improved probe was designed:

cagprobe3  (SEQ ID NO 27).

[0079] The sequence of this probe is shown in table 2.

[0080] Probes P1S1, P22S1a, P1S1b, P2S1b, P1S2 and P2S2 are based on the sequences of the S-region of the vacA gene (FIG. 2), provided by the present invention. These probes are designed to recognize sequences of s1a, s1b and s2 variants, respectively. In a second instance, a larger collection of sequences of the S-region of the vacA gene is disclosed by the present invention, as shown in FIG. 12 (see also example 6). Study of the alignment of these novel sequences, as well as phylogenetic analysis (FIG. 13), reveals the existence of a formerly unknown s1 variant, in addition to the known variants s1a and s1b. This formerly unknown variant is disclosed by the present invention and is denoted s1c. The present invention also provides novel probes, that permit specific hybridization to the s1c variant.

[0081] These probe are: P3s1 (SEQ ID NO 28) P4s1. (SEQ ID NO 29)

[0082] The sequence of these probes is shown in table 2.

[0083] Probes P1M1, P2M1, P1M2 and P2M2 are based on the sequences of the M-region of the vacA gene that are provided by the present invention and that are shown in FIG. 3. These probes are designed for specific hybridization to the m1 and m2 variants. Alignment of a larger number of sequences of the M-region, also provided by the present invention, reveals the presence of 3 sequences that are different from the m1 and m2 variants (FIG. 14), as shown in example 7. These sequences may represent a novel variant in the M-region. According to the present invention, this variant is denoted m3. Based on the sequences of the M-region that are shown in FIG. 14, novel probes have been designed, these probes being: P1M1new (SEQ ID NO 30) P2M1new (SEQ ID NO 31) P1M2new (SEQ ID NO 32) P2M2new (SEQ ID NO 33)

[0084] Probes P1M1new and P2M1new improve upon probes P1M1 and P2M1 in that they are capable, when used together, to specifically hybridize to all m1 sequences shown in FIG. 14. Likewise, probes P1M2new and P2M2new are improved probes that specifically hybridize to all m2 sequences shown in FIG. 14. In addition, a novel probe that specifically hybridizes to the aforementioned m3 sequences, is provided. This probe is:

P1M3  (SEQ ID NO 34).

[0085] The sequences of probes P1M1new, P2M1new, P1M2new, P2M2new and P1M3 are shown in table 2.

[0086] According to another embodiment, the present invention relates to a method for the detection of H.pylori strains present in a sample comprising the steps of:

[0087] (i) if need be releasing, isolating or concentrating the polynucleic acids in the sample;

[0088] (ii) amplifying the polynucleic acids of a relevant target region of the vacA gene with a suitable primer pair, said primer pair being generally applicable on different H.pylori strains, allowing to amplify said relevant target region of the vacA gene preferentially in compatible amplification conditions;

[0089] (iii) hybridizing the polynucleic acids obtained in (i) or (ii) with at least one probe hybridizing to a conserved region of the vacA gene:

[0090] (iv) detecting the hybrids formed in step (iii);

[0091] (v) determining the presence or absence of H.pylori in a sample from the hybridization signals obtained in step (iv).

[0092] Said method is referred to below as the “detection method”.

[0093] According to a preferred embodiment, the present invention relates to a method according to the preceding embodiment, wherein step (ii) consists of amplifying the polynucleic acids of a relevant target region in the vacA gene with suitable primers, said primers being generally applicable on different H. pylori strains, allowing to amplify said relevant target region in compatible amplification conditions, with said target region being a conserved region, with said primers preferentially being VA1F and VA1XR (SEQ ID NO14), or sequence variants thereof, with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—,by others (including modified nucleotides such as inosine), or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize/amplify specifically with the same specificity as the oligonucleotide primers from which they are derived.

[0094] According to an even more preferred embodiment, the present invention relates to a method according to any of the two preceding embodiments, wherein step (iii) casts of hybridizing the polynucleic acids obtained in step (ii) with a set of probes, under appropriate hybridization and wash conditions, said set of probes being preferentially applicable in a simultaneous hybridisation assay and comprising at least one probe hybridizing to a conserved region of the vacA gene of H.pylori, and more preferentially said set of probes comprising at least one of the following vacA-derived probes: HpdiaS1 (SEQ ID NO 35) HpdiaS2 (SEQ ID NO 36) HpdiaS3 (SEQ ID NO 37) HpdiaS4 (SEQ ID NO 38) HpdiaS5 (SEQ ID NO 39)

[0095] or sequence variants thereof, with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—,by others (including modified nucleotides such as inosine), or with said variants consisting of the complement of any of the above-mentioned oligonucleotide probes, or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize specifically with the same specificity as the oligonucleotide probes from which they are derived.

[0096] According to another embodiment, the present invention relates to a probe composition for use in any detection/typing method as defined above, said composition comprising at least one probe hybridizing to a conserved region of a VDG of H.pylori, and at least one probe hybridizing to a variable region of vacA, and more preferentially said probes being derived from the polynucleic acid sequences of the vacA and/or cagA gene of H.pylori, and most preferentially said probes being chosen from SEQ ID NO 1 to 11 and 27 to 34, or sequence variants thereof, with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—,by others (icluding modified nucleotides such as inosine), or with said variants consisting of the complement of any of the above-mentioned oligonucleotide probes, or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize specifically with the same specificity as the oligonucleotide probes from which they are derived.

[0097] According to another embodiment, the present invention relates to a probe composition for use in any detection method as defined above, said composition comprising at least one probe hybridizing to a conserved region of the vacA gene of H.pylori, and most preferentially said probe being chosen from SEQ ID NO 35 to 39, or sequence variants thereof with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—,by others (including modified nucleotides such as inosine), or with said variants consisting of the complement of any of the above-mentioned oligonucleotide probes, or with said variants consisting of ribonucleotides instead of deoxyribonucleotides; all provided that the variants can hybridize specifically with the same specificity as the oligonucleotide probes from which they are derived.

[0098] According to another embodiment, the present invention relates to a composition comprising at least one suitable oligonucleotide amplification primer, allowing to amplify the polynucleic acids of the relevant target regions of the respective VDG, said suitable primers being generally applicable with different H.pylori strains and allowing the amplification of said relevant target regions to be used in compatible amplification conditions, and more preferentially said primers allowing the amplification of a conserved region of the cagA gene and a region of the vacA gene comprising conserved and/or variable target regions, and most preferentially said primers being selected from SEQ ID NO 12 to 26, or sequence variants thereof, with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—,by others (including modified nucleotides such as inosine), or with said variants consisting of the complement of any of the above-mentioned oligonucleotide primers, or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize specifically with the same specificity as the oligonucleotide primers from which they are derived.

[0099] According to an even more specific embodiment, the present invention relates to a probe being derived from the polynucleic acid sequences of the vacA and/or cagA gene of H.pylori, and with said probe being chosen from SEQ ID NO 1 to 11 and 27 to 39, or sequence variants thereof, with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—,by others (including modified nucleotides such as inosine), or with said variants consisting of the complement of any of the above-mentioned oligonucleotide probes, or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize specifically with the same specificity as the oligonucleotide probes from which they are derived.

[0100] According to yet another even more preferred embodiment, the present invention relates to an oligonucleotide amplification primer allowing the amplification of a region of the cagA gene or a region of the vacA gene of H.pylori, and with said primer being selected from SEQ ID NO 12 to 26, or sequence variants thereof with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—,by others (including modified nucleotides such as inosine), or with said variants consisting of the complement of any of the above-mentioned oligonucleotide primers, or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize/amplify specifically with the same specificity as the oligonucleotide primers from which they are derived.

[0101] According to another embodiment, the present invention relates to a method as defined above for the detection and/or typing of alleles of VDG of H.pylori, more preferentially alleles of the cagA and vacA gene of H.pylori, present in a sample using a set of probes and/or primers specially designed to detect and/or to amplify and/or to type the said alleles, with said probes and primers being defined above.

[0102] According to another embodiment, the present invention relates to a method as defined above for the detection of alleles of VDG of H.pylori, more preferentially alleles of the vacA gene of H.pylori, present in a sample using a set of probes and/or primers specially designed to detect and/or to amplify the said alleles, with said probes and primers being defined above.

[0103] In order to detect and/or type the H.pylori strains present in the sample, using the above set of oligonucleotide probes, any hybridization method known in the art can be used (conventional dot-blot, Southern blot, sandwich, chip-based, etc). In order to obtain fast and easy results if a large number of probes is involved, a reverse hybridization format may be most convenient. According to a preferred embodiment, a selected set of probes are immobilized onto a solid support.

[0104] In another preferred embodiment, a selected set of probes are immobilized to a membrane strip. Said probes may be immobilized individually or as mixtures on the solid support.

[0105] A specific and very user-friendly embodiment of the above-mentioned preferential method is the LiPA-method, where the above-mentioned set of probes is immobilized in parallel lines on a membrane, as further described in the examples.

[0106] Alternatively, detection—without typing—of H. pylori strains may be performed conveniently by use of the DNA Enzyme Immuno Assay (DEIA). The principle of this assay as well as an application based on the detection of a conserved part of the S-region of the vacA gene is outlined in example 8.

[0107] Some of the above described probes are directed towards nucleic acid sequences already described in the prior art. However, as illustrated in the examples, nucleic acid sequences of VDG of a large number of new isolates of H.pylori were disclosed for the first time in this invention, providing valuable new information necessarry to succesfully design suitable probes with respect to detecting and more importantly to typing H.pylori strains. These new H. pylori sequences also form part of the present invention.

[0108] Moreover, previously designed primers and probes by other autors (Atherton et al., 1995) are shown in the examples to be less appropriate in typing H.pylori strains in a sample.

[0109] This invention also provides for probes and primers(sets) which are designed to specifically detect or amplify the respective VDG alleles of the new isolates, and provides moreover methods and kits for applying said primers or probes in the detection and/or typing of H.pylori strains in a sample.

[0110] The present invention also provides for a set of primers, allowing amplification of the conserved region spanning the region between the nucleotide at position 1 to the nucleotide at position 250 of the cag gene of H.pylori. The set of primers comprises for instance:

cagF and cagR  (SEQ ID N^(o) 11 and 12)

[0111] Also, the present invention provides sets of primers covering the variable S- and/or M-regions of the vacA gene of H.pylori, said S-region being comprised between the nucleotide at position 1 and 300 and comprising conserved sequences in addition to variable sequences, said M-region being comprised between the nucleotides at the position 1450 and 1650, with said primers for instance being:

VA1-F and VA1-XR  (Atherton et al., 1995 and SEQ ID N^(o) 15)

M1F and M1R  (SEQ ID N^(o) 16 and 17)

[0112] The invention also provides methods and kits to apply the above described primers sets directed to particular regions of VDG genes, e.g the cagA and vacA genes, simultaneously under identical amplification, hybridisation and washing conditions.

[0113] The primers according to the present invention may be labeled with a label of choice (e.g. biotine). Different target amplification systems may be used, and preferentially PCR-amplification, as set out in the examples. Single-round or nested PCR may be used.

[0114] According to yet another embodiment, the present invention relates to a solid support, preferentially a membrane strip, carrying on its surface, at least one probe as defined above.

[0115] According to another embodiment, the present invention relates to a kit for detecting and/or typing H. pylori strains in a sample liable to contain it, comprising the following components:

[0116] when appropriate at least one oligonucleotide primer as defined;

[0117] at least one probe as defined above, with said probe and/or other probes applied being by preference immobilized on a solid support

[0118] a buffer or components necessary to produce the buffer enabling an amplification or a hybridization reaction between these probes and the amplified products;

[0119] when appropriate a means for detecting the hybrids resulting from the preceding hybridization.

[0120] The term “hybridization buffer” means a buffer enabling a hybridization reaction to occur between the probes and the polynucleic acids present in the sample, or the amplified products, under the appropriate stringency conditions.

[0121] The term “washing solution” means a solution enabling washing of the hybrids formed under the appropriate stringency conditions.

[0122] The present invention also relates to isolated vacA polynucleic acid sequences defined by SEQ ID NO 40 to 91 and SEQ ID NO 115 to 276 or any fragment thereof that can be used as a primer or as a probe in a method for detection and/or typing of one or more vacA alleles of H. pylori.

[0123] The present invention also relates to isolated cagA polynucleic acid sequences defined by SEQ ID NO 92 to 114 or any fragment thereof, that can be used as a primer or as a probe in a method for detection and/or typing of one or more cagA alleles of H. pylori.

[0124] The present invention also relates to a vacA protein fragment encoded by any of the nucleic acids with SEQ ID NO 40 to 91 and SEQ ID NO 115 to 276 or any subfragment of said vacA protein fragment, with said subfragment consisting of at least 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or 25 contiguous ammo acids of a vacA protein.

[0125] The present invention also relates to a cagA protein fragment encoded by any of the nucleic acids with SEQ ID NO 92 to 114, or any subfragment of said cagA protein fragment, with said subfragment consisting of at least 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or 25 contiguous amino acids of a cagA protein.

[0126] The following examples serve to illustrate the present invention and are in no way to be construed as lining the scope of this invention. It should also be noted that the contents of all references referred to in this invention are hereby incorporated by reference.

LEGENDS TO THE FIGURES

[0127]FIG. 1: Schematic overview of the S- and M-region of the vacA gene of H.pylori and indication of the overall position of the relevant primers.

[0128]FIG. 2a: DNA sequence alignment of the S-region S1a/b of various H.pylori strains.

[0129]FIG. 2b: DNA sequence alignment of the S-region S2 of various H.pylori strains.

[0130]FIG. 3a: DNA sequence alignment of the M-region M1 of various H.pylori strains.

[0131]FIG. 3b: DNA sequence alignment of the M-region M2 of various H.pylori strains.

[0132]FIG. 4: Agarose gel-electrophoresis of the amplification products using as starting material DNA from the gastric biopsy 18 and primers indicated in example 1.

[0133]FIG. 5: Agarose gel-electrophoresis of the amplification products using as starting material DNA from the gastric biopsy 41 and primers indicated in example 1.

[0134]FIG. 6: Agarose gel-electrophoresis of the amplification products using as starting material DNA from the gastric biopsy F67 and primers indicated in example 1.

[0135]FIG. 7: Agarose gel-electrophoresis of the amplification products using as starting material DNA from the gastric biopsy 25 and primers indicated in example 1.

[0136]FIG. 8: LIPA outline where the probes indicated in the figure are according to table II and primers according to example 3.

[0137]FIG. 9: Multiplex PCR with vacA as well as cagA primers. For vacA primer set G was used (FIG. 1); for capA primers cagF and cagR were used. The isolate shown in the first two lanes contains s1 and m1 alleles and is cagA+. The isolate shown in lanes 4 and 5 (counting from left) contains a multiple infection.

[0138]FIG. 10: Alignment of cagA nucleic acid sequences, encoding the N-terminus of the cagA protein. The position of some cagA primers is indicated. Hyphens indicate gaps introduced to obtain optimal alignment. Asterisks below the alignment indicate identical nucleotides. Dots below the alignment indicate partial conservation.

[0139]FIG. 11: Phylogenetic tree of cagA amino acid sequences. The 16 sequences counting from the top represent the first variant, occurring mainly in Europe and in Australia. USA123 and USA39 are strains from the USA, having an intermediate position. The 7 sequences counting from the bottom (HK7 to HKTh8828) represent a variant that is mainly found in Far East Asia.

[0140]FIG. 12: Alignment of nucleic acid sequences of part of the S-region of the vacA gene. The sequences are grouped according to the variant that they belong to. A larger number of sequences is shown than in FIGS. 2a and 2 b. The variants are from top to bottom: s2, s1c, s1b and s1a. Hyphens indicate that at that position the nucleotide is identical to that in the sequence of strain 29401. Dots indicate a gap in the sequence that was introduced to preserve alignment.

[0141]FIG. 13: Phylogenetic analysis of nucleic acid sequences of part of the S-region of the vacA protein. The variants are indicated.

[0142]FIG. 14: Alignment of nucleic acid sequences of part of the M-region of the vacA gene. A larger number of sequences is shown than in FIGS. 3a and 3 b. Hyphens and dots as in FIG. 12.

[0143]FIG. 15: Phylogenetic analysis of nucleic acid sequences of part of the M-region of the vacA protein. The variants are indicated.

EXAMPLES Example 1 Evaluation of the Use of the Primers Described by Atherton et al., 1995 in Typing H.pylori Strains Within the Framework of Large Scale Clinical Trials

[0144] 1.1 Comparison of VacA Genotyping Methods.

[0145] The efficiency of the vacA genotyping as described by Atherton et al. (1995) was compared to the efficacy as described in the present invention. The method as described by Atherton comprises 6 different PCR reactions:

[0146] A. Using primers VA1F and VA1R, to distinguish s1 and s2 alleles.

[0147] B. Using primers SS1F and VA1R, to amplify s1a sequences.

[0148] C. Using primers SS3F and VA1R, to amplify s1b sequences.

[0149] D. Using primers SS2F and VA1R, to amplify s2 sequences.

[0150] E. Using primers VA3F and VA3R, to amplify m1 sequences.

[0151] F. Using primers VA4F and VA4R, to amplify m2 sequences.

[0152]FIG. 1 shows a schematic representation of all primers involved in vacA typing. Identification of the PCR products is based on visual inspection of DNA bands on an agarose gel.

[0153] 1.2 Problems with the Atherton System:

[0154] The s-region:

[0155] Based on the sequence alignments from European isolates, as shown in FIGS. 2a and 2 b, it is clear that primers SS1F, SS2F, and SS3F may contain several mismatches to their respective target sequences. This may hamper proper annealing of the primers and may lead to amplification of spurious bands. The target sequence for primer SS3F (aimed at detection of the s1b allele), contains two crucial mismatches at the 3′ end of the primer in some isolates (e.g. in isolates F67, F68, F73, F76, F42, F12).

[0156] F67 (see below) showed amplification with primer SS1F and VA1R, whereas amplification with SS3F and VA1R was negative, suggesting the presence of the s1a genotype. However, PCR/LiPA analysis showed the presence of genotype s1a, which was confirmed by sequence analysis.

[0157] Primer SS2F, aimed at s2 sequences, results in amplification of aspecific bands (see e.g. Figure photo 1 & 2, in case of primerset D).

[0158] The m-region:

[0159] As described by Atherton et al., (1995) typing of the m-region was initially based on hybridization with two specific DNA probes, i.e. pCTB4 and VA6 for the M1 and M2 variant, respectively.

[0160] From the published nucleotide alignments of the vacA sequences from strain 60190 (type M1) and Tx30a (U29401; type M2), it is obvious that these two probes cover a region of substantial variation.

[0161] Moreover, the M1 variant shows a deletion (around position 2340 of the 60190 sequence), compared to the M2 variant. One might envisage that this region of deletion/insertion (similar to the S-region) is of major importance to discriminate M1 and M2. However, the PCR primers for specific detection of M1 and M2 are aimed at a different region of the vacA gene, which is more downstream (between positions 2750 and 3030 of the 60190 sequence) and is not covered by the original DNA probes.

[0162] We have analysed the individual PCR primers by sequence alignments to the M1 and M2 sequences. We noticed that the 3′ ends of several primers described by Atherton et al., are not completely unique in the vacA gene.

[0163] Primer VA3-R shows homology to sequences:

[0164] in strain 60190 (Genbank Seq U05676; m1-type):

[0165] around pos 229 (6 nt at the 3′ end)

[0166] around pos 839 (6 nt at the 3′ end)

[0167] around pos 3011 (target sequence, 100%)

[0168] around pos 4653 (6 nt at the 3′ end)

[0169] in strain Tx30a:

[0170] around pos 4271 (6 nt at the 3′ end)

[0171] Primer VA4-F shows homology to sequences:

[0172] in strain Tx30a (GenBank Seq #U2940 1; m2-type):

[0173] around pos 231 (7 nt at the 3′ end)

[0174] around pos 1907 (8 nt at the 3′ end)

[0175] around pos 2297 (target sequence, 100%)

[0176] around pos 2594 (9 nt at the 3′ end)

[0177] Especially the homologies at the very 3′ end may hamper the specificity of these primers. Some spurious bands were obtained when using these primers. Moreover, these primers failed to yield any amplification product in several isolates or biopsies (e.g., biopsy 41, see below). This has been observed before (Maeda, S, K. Ogura, M. Ishitom F. Kanai, H. Yoshida, S. Ota, Y. Shiratori, and M. Omata, abstract #492: Diversity of Helicobacter pylori vacA gene in Japanese strains—high cytotoxin activity type s1 is dominate in Japan, Digestive Disease Week San Fransisco, May 1996).

[0178] We have analysed the M1 and M2 region of the vacA allele from multiple H.pylori strains by DNA sequencing upon PCR amplification using as primers HPMGF and HPMGR (see FIGS. 3a and 3 b). Based on these sequences new primers had to be developed for vacA genotyping in a multiplex PCR as described in example 4.

[0179] 1.3. Comparative Results Are Shown in Figures and Discussed Below:

[0180] The respective primers used by Atherton et al. (1995) were used in A-F while primerset G, comprised the newly designed set of primers comprising VA1F, VA1XR, M1F, and M1R disclosed for the first time in this invention. All of these primers are new as such except VA1F which was disclosed by Atherton et al., 1995. Biopsy #18 (see FIG. 4) A s1/s2 S1 B s1a + C s1b − D s2 − (note the background) E m1 − F m2 + G multi s1/m2

[0181] From this biopsy, the expected fragments were amplified, consistent with a s1a/m2 genotype. Multiplex PCR, followed by LiPA, as described in the present invention, yielded an identical result. Biopsy #41 (see FIG. 5) A s1/s2 s1 B s1a + C s1b − D s2 − (note the background) E m1 − F m2 − G multi s1/m1

[0182] From this biopsy, only the s-region could be typed by the method of Atherton et al. Amplification with the m1 and m2-specific primers did not yield any visible DNA product. However, by the multiplex PCR followed by LiPA, as described in the present invention, a s1a, m1 genotype was detected. Isolate F67 (see FIG. 6) A s1/s2 s1 B s1a − C s1b − D s2 − (note the background) E m1 − F m2 − C multi s1/m1

[0183] LiPA showed the presence of a s1b, instead of s1a. This was confirmed by sequence analysis

[0184] 1.4 Detection of Multiple Infections: Biopsy 25 (see FIG. 7) A s1/s2 both B s1a − C s1b + D s2 + (note the background) E m1 + F m2 + G multi s1/s2/m1/m2

[0185] LiPA analysis revealed the presence of s1b/s2/m1/m2mixed genotypes.

Example 2 Identification and Amplification of a Conserved Region of the cagA Gene Fragment in H.pylori; Designing Primers and a cagA-derived Probe Allowing to Detect H.pylori in a Sample Through Reverse Hybridization

[0186] The establishment of the experimental conditions in order to set up a reverse hybridisation assay in case of the cagA gene comprised i) a theoretical evaluation of suitable probes and primers based upon nucleic acid sequence comparisons using standard DNA analysing computer programmes, and ii) an experimental evaluation and adjustment of the primers and probes to the conditions set for the reverse hybridization technology.

[0187] Comparison of two published nucleic acid sequences of cagA alleles of different H.pylori strains demonstrated that the region between nucleic acids 17 to 113 is highly conserved (Covacci et al., 1993; Tummuru et al., 1993) and said region could be used for positive identification of the presence of the cagA gene in a certain H.pylori strain.

[0188] A set of primers was designed as follows:

[0189] cagF (bp 17 to 40)

[0190] cagR (bp 178-199)

[0191] Both primers are new primer sequences, described by the current invention (see table I). These primers can be labeled with a label of choice (e.g. biotine). Different primer-based target-amplification systems may be used. For amplification using the PCR, the conditions used in case of a single-round amplification with above primers cagF and cagR, involve 40 cycles of 1 min/95° C., 1 min/55° C., 1 min/72° C. followed by a final extension for 5 min at 72° C.

[0192] The PCR reaction mixture was as follows:

[0193] 1 μl DNA sample containing H.pylori or control DNA

[0194] 10 μl 10×polymerase mix (final concentration 10 mM Tris HCl, pH 9.0, 50 mM KCl, 2.5 mM MgCl₂, 0.01% gelatin, and 0.1% Triton)

[0195] 20 μl deoxyribonucleotide mix (final concentration 200 μM each)

[0196] 1 μl Super Taq polymerase (0.25 U/μl)

[0197] 1 μl forward primer (50 pmoles/μl)

[0198] 1 μl reverse primer (50 pmoles/μl)

[0199] 66 μl water

[0200] 100 μl

[0201] Amplification products were analysed on an agarose gel stained with ethidiumbromide and visualized under UV. The amplified product obtained using 1 μl of H.pylori DNA as starting material and above primers consisted of a single band with approximatively molecular weight 0.18 Kb, in agreement with the expected size of 183 bp. Control samples containing DNA from cag(−) H.pylori strains or other bacterial species did not yield any amplification product (data not shown).

[0202] The uniformity of the amplified product was verified through DNA sequencing applying standard sequencing techniques. A 100% match with the described region could be demonstrated (data not shown).

[0203] Also, a number of probes were tested in order to determine optimal hybridization between the above amplified product and the said probes under standardized hybridization and washing conditions applied in the reverse hybridisation assay.

[0204] The below probes tested were chosen from the list indicated in table II. Said probes were immobilized onto a solid support as described in example 3. The amplified product obtained with said above primers was hybridized to the respective probes applying the same conditions as outlined in example 3. Most optimal results were obtained with probe cagapro (SEQ ID N^(o)12), which can thus be used as a positive identification of the presence of the cagA gene in H. pylori strains, in combination with the above primers under the conditions of the below reverse hybridization assay.

Example 3 Identification and Amplification of Variable Target Regions of the VacA Gene in H.pylori; Designing Primers and a VacA-Derived Probe Allowing to Detect and/or Type H.pylori in a Sample Through Reverse Hybridization

[0205] The establishment of the experimental conditions in order to set up a reverse hybridisation assay in case of the vacA gene comprised i) a theoretical evaluation of suitable probes and primers based upon nucleic acid sequence comparisons using standard DNA analysing computer programmes, and ii) an experimental amplification of the various variable regions, DNA sequence analysis of the respective amplified fragments, designing allele-specific probes and appropriate primers, and the evaluation and the adjustment of the primers and probes to the conditions applicable in the reverse hybridization technology.

[0206] Recently, Atherton et al. (1995) demonstrated the presence of two variable regions in the vacA gene, being the S- and M-region. Primers were designed in order to amplificate specifically alleles of the vacA with variable S- and M-regions.

[0207] In this invention, a large number of additional nucleic acid sequences spanning both the said S- or M-region were obtained upon DNA sequence analysis of PCR amplification of said regions. These data are new and are being disclosed here for the first time in the present invention (see FIGS. 2 and 3).

[0208] In order to obtain amplification products spanning either the S- or the M-region of the vacA gene. the following set of primers was used: S-region: VA1-F (see Atherton et al, 1995) VA1-R (see Atherton et al, 1995) M-region: HPMGF (CACAGCCACTTTCAATAACGA) HPMGR (CGTCAAAATAATTCCAAGGG)

[0209] These primers can be labeled with a label of choice (in this case biotine was used). Different primer-based target-amplification systems may be used. For amplification using the PCR, the conditions used in case of a single-round amplification with above primers, involve 40 cycles of 1 min/95° C., 1 min/55° C. 1 min/72° C. followed bad a final extension for 5 min at 72° C.

[0210] The PCR reaction mixture was as follows:

[0211] 1 μl DNA sample, containing H.pylori or control DNA

[0212] 10 μl 10×polymerase mix (final concentration 10 mM Tris HCl, pH 9.0, 50 mM KCl, 2.5 mM MgCl₂, 0.01% gelatin, and 0.1% Triton)

[0213] 20 μl deoxyribonucleotide mix (final concentration 200 μM each)

[0214] 1 μl Super Taq polymerase (0.25 U/μl)

[0215] 1 μl forward primer (50 pmoles/μl)

[0216] 1 μl reverse primer (50 pmoles/μl)

[0217] 66 μl water

[0218] 10 μl

[0219] Amplification products were analysed by DNA sequencing applying standard sequencing techniques. The results of these analyses are given in FIGS. 2 and 3. Based on these analyses, it became obvious that primers being used by others with the aim of allel-specific typing of H.pylori based upon the variable S- and M-region of the vacA gene, could not cover the full range of pathogenic H.pylori strains (see example 1). Thus, new sets of primers, not obvious to the skilled man in the art, were designed in order to develop an assay to detect and type pathogenic H.pylori strains in a sample. The primers and their sequence are given in table I. Also, a number of probes were tested in order to obtain optimal hybridization between the amplified products, generated by the new primer sets, and said probes under standardized hybridization and washing conditions applied in the reverse hybridisation assay. The tested probes are given in table II. Said probes were immobilized unto a solid support as described by Styver et al., 1993. The amplification with said primersets was performed under the conditions and protocol as described above in this example. The amplified products obtained with said above primers were hybridized to the respective probes (see FIG. 8).

[0220] Optimal results were obtained combining the following primers: VA1-F (Atherton et al., 1995) VA1XR (SEQ ID NO 14) M1F (SEQ ID NO 15) M1R (SEQ ID NO 16)

[0221] with the following probes: P1S1 (SEQ ID NO 2) P22S1a (SEQ ID NO 3) P1S1b (SEQ ID NO 4) P2S1b (SEQ ID NO 5) P1S2(VAS2) (SEQ ID NO 6) P2S2 (SEQ ID NO 7) P1M1 (SEQ ID NO 8) P2M1 (SEQ ID NO 9) P1M2 (SEQ ID NO 10) P2M2 (SEQ ID NO 11)

Example 4 Development of the Line Probe Assay (LiPA)-Strip

[0222] The principle and protocol of the line probe assay was in essence as described earlier (Stuyver et al., 1993). Good results were obtained combining the following primers: cagF (SEQ ID NO12) cagR (SEQ ID NO13) VA1-F (Atheron et al., 1995) VA1XR (SEQ ID NO14) M1F (SEQ ID NO15) M1R (SEQ ID NO16)

[0223] with the following probes: cagApro (SEQ ID NO 1) P1S1 (SEQ ID NO 2) P22S1a (SEQ ID NO 3) P1S1b (SEQ ID NO 4) P2S1b (SEQ ID NO 5) P1S2(VAS2) (SEQ ID NO 6) P2S2 (SEQ ID NO 7) P1M1 (SEQ ID NO 8) P2M1 (SEQ ID NO 9) P1M2 (SEQ ID NO 10) P2M2 (SEQ ID NO 11)

[0224] The said primers were labeled with biotine. Different primer-based target-amplification systems may be used. For amplification using the PCR, the conditions used in case of a single-round amplification with above primers, involve 40 cycles of 1 min/95° C., 1 min/55° C., 1 min/72° C. followed by a final extension for 5 min at 72° C.

[0225] The PCR reaction mixture was as follows:

[0226] 1 μl DNA sample, containing H.pylori or control DNA

[0227] 10 μl 10×polymerase mix (final concentration 10 mM Tris HCl pH 9.0, 50 mM KCl, 2.5 mM MgCl₂, 0.01% gelatin, and 0.1% Triton)

[0228] 20 μl deoxyribonucleotide mix (final concentration 200 μM each)

[0229] 1 μl Super Taq polymerase (0.25 U/μl)

[0230] 1 μl forward primer (50 pmoles/μl)

[0231] 1 μl reverse primer (50 pmoles/μl)

[0232] 66 μl water

[0233] 100 μl

[0234] The sequence of these primers is given in table 2. An example of the amplification products generated by use of vacA s/m region-primers or cagA-primers is shown in FIG. 9. For this experiment primer set G (FIG. 1) was used for vacA and cagF and cagR were used for cagA. The isolate shown in the first two lanes contains s1 and m1 alleles and is cagA+. The isolate shown in lanes 4 and 5 (counting from left) contains a multiple infection. The results of the LiPA are shown in FIG. 8.

Example 5 Novel DNA Sequences of a Fragment of the cagA Gene of H. pylori and Design of Primers and a Probe Based Thereon

[0235] The 5′ part of the cagA gene was amplified by PCR from various H. pylori isolates, using different primer combinations. The resulting fragments were sequenced and the alignment is shown in FIG. 10. The sequences comprised 449-464 bp, starting at the start codon of the ORF. A total of 149-154 amino acids representing the N-terminus of the cagA protein can be derived by translation of these sequences, starting at the ATG codon at position 1 in FIG. 10.

[0236] As shown by phylogenetic analysis in FIG. 11, 2 different forms of capA were recognized The first variant is highly homologous to the reference sequence (Genbank accession number L11741 (HECMAJANT) or X70039 (HPCAI)) and occurs mainly in strains from Europe and Australia. Two sequences from the USA (J123 and J39) seem to have intermediate positions in the phylogenetic tree. The second variant, mainly found in strains from Far East Asia. contains 15 additional nucleotides between nt positions 20 and 31, encoding 5 additional amino acids between positions 8-9, as compared to the reference sequence.

[0237] From the nucleotide sequence alignment the following novel primers and probe were deduced, aimed at highly conserved regions in the cagA gene. TABLE 3 CagA primers and probe position/ primer/probe 5′ to 3′ sequence orientation¹ primers cagFN1 GATAAGAAYGATAGGGATAA +(142-161) cagRN1 AATACTGATTCTTTTTGG −(230-247) probe cagprobe3 GGATTTTTGATCGCTTTATT −(219-227)

Example 6 Novel DNA Sequences of the s-region of the VacA Gene of H. pylori and Design of Probes Based Thereon

[0238] VacA s-region fragments were amplified from a large number of H. pylori isolates, using primers VA1-F and VA1-R (Atherton et al., 1995). This resulted in fragments of 176 bp for s1 and 203 bp for s2 types sequences. Parts of these fragments were sequenced, and the resulting alignment of 80 sequences (including 2 reference sequences U29401 and U07145) is shown in FIG. 12. Apart from the already known s1a and s1b type sequences, a third variant was observed, mainly in isolates from Far East Asia (Japan, China, Hong Kong). This variant is designated s1c. Type s1c has several highly consistent mutations as compared to type s1b and s1a. These mutations allow specific recognition of each of the s1subtypes.

[0239] Phylogenetic analysis, as shown in FIG. 13, reveals distinct clusters of s1a, s1b, s1c and s2 sequences. The N-terminal parts of the vacA protein can be deduced from the nucleic acid sequences of the s1a, s1b, s1c, and s2 variants by translation starting at codon CCT at position 2 in FIG. 12. This reveals the presence of a single conserved amino acid mutation (Lys) at position 22 in subtype s1c as compared to s1a and/or s1b sequences. All other nucleotide mutations appear to be silent.

[0240] New probes were designed to specifically detect the s1c variants: P3s1: 5′ GGGYTATTGGTYAGCATCAC 3′ (positions 26-45) P4s1: 5′ GCTTTAGTRGGGYTATTGGT 3′ (positions 17-36)

[0241] Thus, for optimal detection of the vacA s-region variants, the following probes were used: for s1a: P1S1 and P22S1a for s1b: P1s1b and P2s1b for s2: P1S2 (VAS2) and P2S2 for s1c: P3s1 and P4s1

Example 7 Novel DNA Sequences of the m-region of the VacA Gene of H. pylori and Design of Probes Based Thereon

[0242] The vacA m-region was analyzed from a number of H. pylori isolates, by using primers HPMGF and HPMGR. These primers allow general amplification of larger parts of the m-region sequences and generate fragments of 401 and 476 bp for m1 and m2 variants, respectively. Fragments were sequenced and the alignment of 86 m-region sequences (including reference sequences U05677, U07145, U05676 and U29401) is shown in FIG. 14. The phylogenetic tree is shown in FIG. 15. The alignment revealed the presence of 3 sequences (Ch4, Hk41, Hk46) that are different from the published m1 and m2 variants. These sequences may represent another variant in the m-region. Said new variant may be denoted m3.

[0243] These alignments revealed that the target sequence for forward primer M1F (SEQ ID NO15) was not completely conserved among all isolates. The target sequence for reverse primer M1R appeared highly conserved among all isolates. As an alternative for forward primer M1F the following primers were designed, as shown in table 4. TABLE 4 Novel forward primers for the vacA m-region primer sequence 5′ to 3′ orientation VAMSFb: GTGGATGCCCATACGGCTAA forward VAMSFc GTGGATGCTCATACAGCTWA forward VAMSFd GTGGATGCCCATACGATCAA forward VAMSFe GCGAGCGCTCATACGGTCAA forward

[0244] PCR amplification in the m-region of the vacA gene can thus be performed by use of VAMSFb,c,d, and e as forward primers, and M1R as the reverse primer.

[0245] Novel probes were designed for specific hybridization to m1 and m2 variants. Their sequence is based on the above-mentioned probes P1m1 , P2m1, P1m2 and P2m2. In order to obtain reactivity with all sequences, a few degeneracies were included. The novel sequences are shown in table 5. For specific identification of m3 variants, a single probe is added (P1m3). TABLE 5 Novel probes for the vacA m-region probe sequence 5′-3′ positions P1m1new TTGATACKGGTAATGGTGG as for P1m1 P2m1new KGGTAATGGTGGTTTCAACA as for P2m1 P1m2new KGGTAATGGTGGTTTCAACA as for P1m2 P2n2new AGAGCGATAAYGGKCTAAACA as for P2m2 P1m3 AGGGTAGAAATGGTATCGACA 1577-1597¹

Example 8 Detection of H. pylori DNA by PCR and DNA Enzyme Immuno Assay (DEIA)

[0246] This method is used for rapid and specific detection of PCR products. PCR products are generated by a primerset, of which either the forward or the reverse primer contain biotin at the 5′ end. This allows binding of the biotinylated amplimers to streptavidin-coated microtiter wells. PCR products are denatured by sodium hydroxide, which allows removal of the non-biotinylated strand. Specific digoxigenin(DIG)-labelled oligonucleotide probes are hybridized to the single-stranded immobilized PCR product and hybrids are detected by enzyme-labelled conjugate and colorimetric methods.

[0247] For detection of H. pylori DNA, the vacA s-region is used as a target. PCR primers VA1F and biotinylated VA1XR are used for PCR of the vacA s-region. A multiplex PCR can be performed on the vacA s and m-regions. The result of PCR is then tested by the DEIA, using probes aimed at the s-region. In case of a positive result the same PCR mixture, including amplimers from both the vacA s- and m-regions, can subsequently be used on a vacA LIPA.

[0248] The PCR mixtures can be composed as follows: 1 μl target DNA 5 μl 10 × PCR buffer (final concentration 10 mM TrisHCl pH 8.3, 50 mM KCl, 1-3 mM MgCl₂) 10 μl 5 × dNTP's (1 mM) 0.3 μl AmpliTaq Gold DNA polymerase (5 units/μl) 1 μl VA1F (25 pmoles/μl) 1 μl VA1Xr (25 pmoles/μl) 1 μl VAMSFb (25 pmoles/μl) 1 μl VAMSFc (25 pmoles/μl) 1 μl VAMSFd (25 pmoles/μl) 1 μl VAMSfe (25 pmoles/μl) 1 μl M1R (25 pmoles/μl) 26.3 μl water 50 μl total

[0249] The following PCR program can be used:

[0250] 9 min pre-incubation at 94° C.

[0251] 40 cycles of 1 min 94° C., 1 min 50° C., and 1 min 72° C.

[0252] final extension: 5 min at 72° C.

[0253] The mixture of probes used for detection of the vacA s-region is shown in table 6. TABLE 6 Probes for detection of vacA s-region amplimers by DEIA probe sequence target HpdiaS1 DIG-CATGCYGCCTTCTTTACAACCGT s1 HpdiaS2 DIG-CATGCCGCCTTTTTCACRACCGT s1 HpdiaS3 DIG-CATGCCGCTCTTTTTACAACCGT s1 HpdiaS4 DIG-CATGCCGCCTTTTTTACAACCGT s1 HpdiaS5 DIG-AGTCGCGCYTTTTTYACAAGCGT s2

[0254] Practically, microtiterplate wells were precoated with streptavidin. Ten μl of PCR product was mixed with amplimer diltion buffer (1×SSC, 0.1% Tween-20, and 0.004% phenol red). After incubation at 42° C. for 30 minutes, the wells were washed 3 times with 400 μl washing solution (1×SSC, 0.1% Tween-20). The captured PCR products were denatured by addition of 100 μl of 0.1M NaOH into the well and incubated for 5 minutes at room temp. The fluid, containing the unbiotinylated eluted strand was removed. 100 μl hybridization solution containing 1×SSC, 0.1% Tween-20, 0.004% phenol red and 1 pmole of digoxigenin (DIG)-labelled oligonucleotide probe(s) were added to the well and incubated for 45 minutes in a waterbath at 42° C. After washing the wells 3 times with washing solution, 100 μl of 75 mU/ml anti-digoxdgenin-peroxidase conjugate (Boehringer Mannheim) was added and incubated for 15 minutes in a waterbath at 42° C. The unbound conjugate was removed by washing the wells 5 times with washing solution. 100 μl of substrate solution containing tetramethylbenzidine (TMB) was added to the wells. After incubation for 15 minutes at room temperature the colour reaction was stopped by addition of 100 μl 0.5M sulphuric acid. The optical density of the wells was read at 450 nm in a microtiter plate reader.

[0255] For interpretation of the results, optical densities of the samples were compared with negative controls and borderline positive controls. Table 7 shows the result of a DEIA analysis of 6 samples. Sample 1 and 5 yield an optical density that is higher than that of the borderline positive control; these samples are therefore considered positive. The optical density of the other samples is lower than the borderline positive control; they are considered negative. TABLE 7 Results of a DEIA test Sample OD conclusion negative positive control 1.178 borderline pos. control 0.214 negative control 0.102 sample 1 >4.0 positive sample 2 0.086 negative sample 3 0.098 negative sample 4 0.108 negative sample 5 2.146 positive sample 6 0.096 negative

[0256] TABLE 1 Nucleotide sequence of the primers: cagF SEQ ID NO 12 5′-TTGGACCAACAACCACAAACCGAAG-3′ cagR SEQ ID NO 13 5′-CTTCGCTTAATTGCGAGATTCC-3′ VA1-F Atherton et al., 1995 5′-ATGGAAATACAACAAACACAC-3′ VA1XR SEQ ID NO 14 5′-CCTGARACCGTTCCTACAGC-3′ M1F SEQ ID NO 15 5′-GTGGATGCYCATACRGCTWA-3′ M1R SEQ ID No 16 5′-RTGAGCTTGTTGATATTGAC-3′ HPMGF SEQ ID No 17 5′-CACAGCCACTTTCAATAACGA-3′ HPMGR SEQ ID No 18 5′-CGTCAAAATAATTCCAAGGG-3′ cagSF SEQ ID No 19 5′-CAACAACGACAAACGGAAG-3′ cagSR SEQ ID No 20 5′-GATTGGTTTTTTGATCAGGATC-3′ cagFN1 SEQ ID No 21 5′-GATAAGAAYGATAGGGATAA-3′ cagRN1 SEQ ID No 22 5′-AATACTGATTCTTTTTGG-3 VAMSFb SEQ ID No 23 GTGGATGCCCATACGGCTAA VAMSFc SEQ ID No 24 GTGGATGCTCATACAGCTWA VAMSFd SEQ ID No 25 GTGGATGCCCATACGATCAA VAMSFe SEQ ID No 26 GCGAOCGCTCATACGGTCAA

[0257] TABLE 2 Nucleotide sequence of the probes: cngApro SEQ ID NO 1 GTTGATAACGCTGTCGCTTC (pos. 94-113) P151 SEQ ID NO 2 GGAGCRTTRGTCAGCATCAC (pos. 61-80 of vacA ORF of strain 60190 (Genbank Acc. U05676)) P22S1a SEQ ID NO 3 GCTTTAGTAGGAGCRTTRGTC (pos. 52-72 of vacA ORf of strain 60190 (Genbank Acc. U05676)) P1S1b SEQ ID NO 4 GGAGCGTTGATTAGYKCCAT (pos. 61-80) P2S1b SEQ ID NO 5 GTTTTAGCAGGAGCGTTGA (pos. 52-72) P1S2(VAS2) SEQ ID NO 6 GCTAAYACGCCAAAYGATCG (pos. 88-107 of vacA ORF of strain Tx30a (Genbank Acc. U29401)) P2S2 SEQ ID NO 7 GATCCCATACACAGCGAGAG (pos. 103-122 of vacA ORF of strain Tx30a (Genbank Acc. U29401)) P1M1 SEQ ID NO 8 TTGATACGGGTAATGGTGG (pos. 1526-1544 of vacA ORF of strain 60190 (Genbank Acc. U05676)) P2M1 SEQ ID NO 9 GGGTAATGGTGGTTTCAACA (pos. 1533-1552 of vacA ORF of strain 60190 (Genbank Acc. U05676)) P1M2 SEQ ID NO 10 ACGAATTTAAGAGTGAATGGC (pos. 1522-1542 of vacA ORF of strain Tx30a (Genbank Acc. U29401)) P2M2 SEQ ID NO 11 AGAGCGATAACGGGCTAAACA (pos. 1577-1597 of vacA ORF of strain Tx30a (Genbank Acc. U29401)) cagprobe3 SEQ ID NO 27 GGATTTTTGATCGCTTTATT (pos. 219-227) P3S1 SEQ ID NO 28 GGGYTATTGGTYAGCATCAC (pos. 26-45) P4S1 SEQ ID NO 29 GCTTTAGTRGGGYTATTGGT (pos. 17-36) P1M1new SEQ ID NO 30 TTGATACKGGTAATGGTGG P2M1new SEQ ID NO 31 KGGTAATGGTGGTTTCAACA P1M2new SEQ ID NO 32 KGGTAATGGTGGTTTCAACA P2M2new SEQ ID NO 33 AGAGCGMAAYGGKCTAAACA P1M3 SEQ ID NO 34 AGGGTAGAAATGGTATCGACA HpdiaS1 SEQ ID NO 35 DIG-CATGCYGCCTTCTTTACAACCGT HpdiaS2 SEQ ID NO 36 DIG-CATGCCGCCTTTTTCACRACCGT HpdiaS3 SEQ ID NO 37 DIG-CATGCCGCTCTTTTTACAACCGT HpdiaS4 SEQ ID NO 38 DIG-CATGCCGCCTTTTTTACAACCGT HpdiaS5 SEQ ID NO 39 DIG-AGTCGCGCYTTTTTYACAACCGT

References

[0258] Akopyanz, N., Bukanov, N. O., Westblom, T. R, Kresovich, S., and Berg, D. E. (1992) Nuc. Acids Res, 20, 5137-5142.

[0259] Covacci A., Censini, S., Bugnoli, M., Petracca, R., Burroni, D., Macchia, G., Massone, A., Papini, E., Xiang, Z., Figura, N., and Rappuoli, R. (1993) Proc. Natl. Acad. Sci. U.S.A. 90, 5791-5795.

[0260] Cover, T. L., and Blaser, M. J. (1995) Adv. Intern. Med. 41, in press.

[0261] Cover, T. L., and Blaser, M. J. (1992) J. Biol. Chem. 267, 10570-10575.

[0262] Foxall, P. A., Hu, L-T., and Mobley, H. L. T. (1992) J. Clin. Microbiol. 30, 739-741.

[0263] Hentschel, E., Brandstatter, G., Dragosics, B., Hirschl, A. M., Nemec, H. Schutze, K., Taufer, M., and Wurzer, H. (1993) N. Engl. J. Med. 328, 308-312.

[0264] Leunk, R. D., Johnson, P. T., David, B. C., Kraft, W. G., and Morgan, D. R. (1988) J. Med. Microbiol. 26, 93-99.

[0265] Parsonnet, J., Friedman, G. D., Vandersteen, D. P., Chang, Y., Vogelman, J. H., Orentreich, N., and Sibley, R. K. (1991) N. Engl. J. Med. 325, 1127-1131.

[0266] Phadnis, S. H., Ilver, D., Janzon, L., Normark, S., and Westblom, T. U. (1994) Infect. Immun. 62, 1557-1565.

[0267] Schmitt, W., and Haas, R. (1994) Mol. Microbiol. 12, 307-319.

[0268] Telford, J. L., Ghiara, P., Dell'Orco, M., Comanducci, M., Burroni, D., Bugnoli, M., Tecce, M. F., Censini, S., Covacci A., Xiang, Z., Papini, E., Montecucco, C., Parente, L., and Rappuoli, R. (1994) J. Exp. Med. 179, 1653-1658.

[0269] Tummuru, M. K. R., Cover, T. L., and Blaser, M. J. (1993) Infect. Immun. 61, 1799-1809.

1 280 20 base pairs nucleic acid single linear DNA (genomic) NO NO 1 GTTGATAACG CTGTCGCTTC 20 20 base pairs nucleic acid single linear DNA (genomic) NO NO 2 GGAGCRTTRG TCAGCATCAC 20 21 base pairs nucleic acid single linear DNA (genomic) NO NO 3 GCTTTAGTAG GAGCRTTRGT C 21 20 base pairs nucleic acid single linear DNA (genomic) NO NO 4 GGAGCGTTGA TTAGYKCCAT 20 19 base pairs nucleic acid single linear DNA (genomic) NO NO 5 GTTTTAGCAG GAGCGTTGA 19 20 base pairs nucleic acid single linear DNA (genomic) NO NO 6 GCTAAYACGC CAAAYGATCC 20 20 base pairs nucleic acid single linear DNA (genomic) NO NO 7 GATCCCATAC ACAGCGAGAG 20 19 base pairs nucleic acid single linear DNA (genomic) NO NO 8 TTGATACGGG TAATGGTGG 19 20 base pairs nucleic acid single linear DNA (genomic) NO NO 9 GGGTAATGGT GGTTTCAACA 20 21 base pairs nucleic acid single linear DNA (genomic) NO NO 10 ACGAATTTAA GAGTGAATGG C 21 21 base pairs nucleic acid single linear DNA (genomic) NO NO 11 AGAGCGATAA CGGGCTAAAC A 21 24 base pairs nucleic acid single linear DNA (genomic) NO NO 12 TTGACCAACA ACCACAAACC GAAG 24 22 base pairs nucleic acid single linear DNA (genomic) NO NO 13 CTTCCCTTAA TTGCGAGATT CC 22 20 base pairs nucleic acid single linear DNA (genomic) NO NO 14 CCTGARACCG TTCCTACAGC 20 20 base pairs nucleic acid single linear DNA (genomic) NO NO 15 GTGGATGCYC ATACRGCTWA 20 20 base pairs nucleic acid single linear DNA (genomic) NO NO 16 RTGAGCTTGT TGATATTGAC 20 21 base pairs nucleic acid single linear DNA (genomic) NO NO 17 CACAGCCACT TTCAATAACG A 21 20 base pairs nucleic acid single linear DNA (genomic) NO NO 18 CGTCAAAATA ATTCCAAGGG 20 19 base pairs nucleic acid single linear DNA (genomic) NO NO 19 CAACAACCAC AAACCGAAG 19 21 base pairs nucleic acid single linear DNA (genomic) NO NO 20 GATTGGTTTT TGATCAGGAT C 21 20 base pairs nucleic acid single linear DNA (genomic) NO NO 21 GATAAGAAYG ATAGGGATAA 20 18 base pairs nucleic acid single linear DNA (genomic) NO NO 22 AATACTGATT CTTTTTGG 18 20 base pairs nucleic acid single linear DNA (genomic) NO NO 23 GTGGATGCCC ATACGGCTAA 20 20 base pairs nucleic acid single linear DNA (genomic) NO NO 24 GTGGATGCTC ATACAGCTWA 20 20 base pairs nucleic acid single linear DNA (genomic) NO NO 25 GTGGATGCCC ATACGATCAA 20 20 base pairs nucleic acid single linear DNA (genomic) NO NO 26 GCGAGCGCTC ATACGGTCAA 20 20 base pairs nucleic acid single linear DNA (genomic) NO NO 27 GGATTTTTGA TCGCTTTATT 20 20 base pairs nucleic acid single linear DNA (genomic) NO NO 28 GGGYTATTGG TYAGCATCAC 20 20 base pairs nucleic acid single linear DNA (genomic) NO NO 29 GCTTTAGTRG GGYTATTGGT 20 19 base pairs nucleic acid single linear DNA (genomic) NO NO 30 TTGATACKGG TAATGGTGG 19 20 base pairs nucleic acid single linear DNA (genomic) NO NO 31 KGGTAATGGT GGTTTCAACA 20 20 base pairs nucleic acid single linear DNA (genomic) NO NO 32 KGGTAATGGT GGTTTCAACA 20 21 base pairs nucleic acid single linear DNA (genomic) NO NO 33 AGAGCGATAA YGGKCTAAAC A 21 21 base pairs nucleic acid single linear DNA (genomic) NO NO 34 AGGGTAGAAA TGGTATCGAC A 21 23 base pairs nucleic acid single linear DNA (genomic) NO NO 35 CATGCYGCCT TCTTTACAAC CGT 23 23 base pairs nucleic acid single linear DNA (genomic) NO NO 36 CATGCCGCCT TTTTCACRAC CGT 23 23 base pairs nucleic acid single linear DNA (genomic) NO NO 37 CATGCCGCTC TTTTTACAAC CGT 23 23 base pairs nucleic acid single linear DNA (genomic) NO NO 38 CATGCCGCCT TTTTTACAAC CGT 23 23 base pairs nucleic acid single linear DNA (genomic) NO NO 39 AGTCGCGCYT TTTTYACAAC CGT 23 184 base pairs nucleic acid single linear DNA (genomic) NO NO 40 CCCTCTGGTT TCTCTCGCTT TAGTAGGAGC ATTGGTCAGC ATCACACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGGTATCG CTACAGGCAC 120 CGCTGTAGGA ACGGTCTCAG GGCTTCTTAG TTGGGGACTA AAACAAGCCG AAGAAGCCAA 180 TAAA 184 199 base pairs nucleic acid single linear DNA (genomic) NO NO 41 TCGCCCTCTN GTTTCTCTCG CTTTAGTAGN AGCATTNGTC AGCATCACAC CGCAACANAG 60 TCATGCCGCC TTTTTCACAA CCGTNATCAT TCCAGCCATT GTTGGGGGTA TNGCTACAGG 120 CACCGCTGTA GGAACGGTCT CAGGGCTTCT TAGTTGGGGA CTAAAACAAG CCGAAGAAGC 180 CAATAAAACC CCAGATAAA 199 227 base pairs nucleic acid single linear DNA (genomic) NO NO 42 GAATCGCCCT YTGGTTTCTC TTGCTTTAGT AGGAGCATTG RTTAGYRYCA YACCGCAACA 60 AAGTCATGCC GCCTTTTTYA CRACCGTGAT CATTCCAGCC ATTGTTGGRG GTATCGCTAC 120 AGGCACTGCT GTAGGAACGG TCTCAGGGCT TCTTAGTTGG GGRCTCAAAC AAGCCGAAGA 180 AGCSAATAAA ACCCCRGATA AACCCGATAA AGTTTGGCGC ATTCAAG 227 176 base pairs nucleic acid single linear DNA (genomic) NO NO 43 GCCCTTTAGT TTCTCTCGCT TTAGTGGGGT TATTGGTCAG CATCACACCA CAAAAAAGTC 60 ATGCCGCCTT TTTCACAACC GTGATCATTC CAGCCATTGT TGGAGGTATC GCTACAGGTG 120 CTGCTGTAGG AACGGTCTCA GGGCTTCTTG GTTGGGGGCT CAAACAAGCC GAAGAA 176 185 base pairs nucleic acid single linear DNA (genomic) NO NO 44 GCCCTCTGGT TTCTCTCGCT TTAGTAGGAG CATTGGTCAG CATCACACCG CAACAAAGTC 60 ATGCCGCCTT TTTCACAACC GTGATCATTC CAGCCATTGT TGGAGGTATC GCTACAGGCG 120 CTGCTGTAGG AACGGTCTCA GGGCTTCTTA GCTGGGGGCT CAAACAAGCC GAAGAAGCCA 180 ATAAA 185 204 base pairs nucleic acid single linear DNA (genomic) NO NO 45 AATCGCCCTC TGGTTTCTCT CGCTTTAGTA GGAGCATTGG TCAGCATCAC ACCGCAACAA 60 AGTCATGCCG CCTTTTTTAC AACCGTGATC ATTCCAGCCA TTGTTGGAGG TATCGCTACA 120 GGCGCTGCTG TAGGAACGGT CTCAGGGCTT CTTAGCTGGG GGCTCAAACA AGCCGAACAA 180 GCCAATAAAG CCCCGGACAA ACCC 204 207 base pairs nucleic acid single linear DNA (genomic) NO NO 46 GAATCGCCCT TTAGTTTCTC TTGCTTTAGT AGGAGCATTG GTCAGCATCA CACCGCAACA 60 AAGTCATGCC GCCTTTTTCA CAACCGTGAT CATTCCAGCC ATTGTTGGGG GTATCGCTAC 120 AGGCGCTGCT GTAGGAACGG TTTCAGGGCT TCTTGGCTGG GGGCTAAAAC AAGCCGAAGA 180 AGCCAATAAA ACCCCAGATA AACCCGA 207 207 base pairs nucleic acid single linear DNA (genomic) NO NO 47 CAATCGCCCT CTAGTTTCTC TCGCTTTAGT AGGAGCATTG GTCAGCATCA CACCGCAACA 60 AAGTCATGCC GCCTTTTTCA CAACCGTGAT CATTCCAGCC ATTGTGGGGG GTATCGCTAC 120 AGGCGCTGCT GTAGGAACGG TCTCAGGGCT TCTTAGCTGG GGGCTCAAAC AAGCCGAAGA 180 AGCCAATAAA ACCCCGGACA AACCCGA 207 207 base pairs nucleic acid single linear DNA (genomic) NO NO 48 CAATCGCCCT CTGGTTTCTC TTGCTTTAGT AGGAGCGTTA GTCAGCATCA CACCGCAACA 60 AAGTCATGCC GCCTTTTTCA CAACCGTGAT CATTCCAGCC ATTGTTGGGG GGATCGCTAC 120 AGGCGCTGCT GTAGGAACGG TCTCAGGGCT TCTTAGCTGG GGGCTCAAAC AAGCCGAAGA 180 AGCCAATAAA ACCCCAGATA AACCCGA 207 176 base pairs nucleic acid single linear DNA (genomic) NO NO 49 CCCTTTAGTT TCTCTTGTTT TAGCAGGAGC GTTGATTAGC GCCATACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACGACCG TGATCATTCC AGCCATTGTT GGGGGTATCG CTACAGGCAC 120 CGCTGTAGGA ACGGTTTCAG GGCTTCTTAG CTGGGGGCTC AAACAAGCCG AAGAAG 176 187 base pairs nucleic acid single linear DNA (genomic) NO NO 50 CCCTTTAGTT TCTCCTGTTT TAGCAGGAGC GTTGATTAGC TCCATACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGGTATCG CTACAGGCAC 120 TGCTGTAGGA ACGGTCTCAG GGCTTCTTAG CTGGGGGCTC AAACAAGCYG AASAAGCSAA 180 TAAAGCC 187 193 base pairs nucleic acid single linear DNA (genomic) NO NO 51 CTTGAGTTTC TCTTGTTTTA GCAGGAGCGT TGATTAGCGC CATACCGCAA CAAAGTCATG 60 CCGCCTTTTT CACGACCGTG ATCATTCCAG CCATTGTTGG GGGTATCGCT ACAGGCACCG 120 CTGTAGGAAC GGTTTCAGGG CTTCTTAGCT GGGGGCTCAA ACAAGCCGAA GAAGCCAATA 180 AAACCCCAGA TAA 193 196 base pairs nucleic acid single linear DNA (genomic) NO NO 52 GCCCTTTAGT TTCTCTCGTT TTAGCAGGAG CGTTGATTAG CTCCATACCG CAACAAAGTC 60 ATGCCGCCTT TTTCACAACC GTGATCATTC CAGCCATTGT TGGGGGTATC GCTACAGGCA 120 CCGCTGTAGG AACGGTTTCA GGGCTTCTTA GCTGGGGGCT CAAACAAGCC GAACAAGCCA 180 ATAAAGCCCC GGACAA 196 131 base pairs nucleic acid single linear DNA (genomic) NO NO 53 AATCGCCCTT TAGTTTCTCT TGTTTTAGCA GGAGCGTTGA TTAGCGCCAT ACCGCAACAA 60 AGTCATGCCG CCTTTTTCAC GACCGTNATC ATTCCAGCCA TTGTTNNNNG TATCGCTACA 120 GGCACCGCTG T 131 185 base pairs nucleic acid single linear DNA (genomic) NO NO 54 CTACGCCCTT TAGTTTCTCT CGTTTTAGCA GGAGCGTTGA TTAGCGCCAT ACCGCAACAA 60 AGTCATGCCG CCTTTTTCAC AACCGTGATC ATTCCAGCCA TTGTTGGGGG CATCGCTTCA 120 GGCGCTGCTG TAGGAACGGT CTCAGGGCTT CTTAGTTGGG GACTCAAACA AGCCGAAGAA 180 GCGAA 185 201 base pairs nucleic acid single linear DNA (genomic) NO NO 55 AATCGCCCTT TAGTTTCTCT CGTTTTAGCA GGAGCGTTGA TTAGCGCCAT ACCGCAAGAG 60 AGTCATGCCG CCTTTTTCAC AACCGTNATC ATTCCAGCCA TTGTTGGGGG TATCGCTACA 120 GGCACCGCTG TAGGAACGGT CTNAGGGCTT YTTAGTTGGG GACTNWAACA AGCCGAAGAA 180 GCCAATAAAA CCCCGGATAA A 201 187 base pairs nucleic acid single linear DNA (genomic) NO NO 56 CCCTTTAGTT TCTCTCGTTT TAGCANNAGC GTTGATTAGC RCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTTACAACCG TNATCATTCC AGCCATTGTT GGGGGTATCG CTACAGGYGC 120 TGCTGTAGGA ACGGTCTCAG GGCTTCTTAG CTGGGGGCTC AAACAAGCCG AACAAGCCAA 180 TAAAGCC 187 187 base pairs nucleic acid single linear DNA (genomic) NO NO 57 CCCTTTGGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC TCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGGTATCG CTACAGGCGC 120 TGCTGTAGGA ACGGTCTCAG GGCTTCTTAG CTGGGGGCTC AAACAAGCCG AAGAAGCGAA 180 TAAAACC 187 187 base pairs nucleic acid single linear DNA (genomic) NO NO 58 CCCTTTAGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC TCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGGTATCG CTACAGGCAC 120 CGCTGTAGGA ACGGTCTCAG GGCTTCTTAG CTGGGGACTC AAACAAGCCG AAGAAGCGAA 180 TAAAACC 187 185 base pairs nucleic acid single linear DNA (genomic) NO NO 59 CCCTTTAGTT TCTCTCGTTT TAGCAGNAGC GTTNATTAGT GCCATACCGC AANANAGTCA 60 TGCCGCCTTT TTCACNACCG TNATCATTCC AGCCATTGTT GGGGGTATCG CCACAGGCAC 120 CGCTGTAGGA ACGGTCTCAG GGCTTCTTAG TTGGGGACTN AAACAAGCCG AAGAAGCGAA 180 TAAAA 185 199 base pairs nucleic acid single linear DNA (genomic) NO NO 60 CAGGAGCGTT GATTAGTGCC ATACCGCAAG AGAGTCATGC CGCCTTTTTC ACGACCGTGA 60 TCATTCCAGC CATTGTTGGG GGTATCGCCA CAGGCACCGC TGTAGGAACG GTCTCAGGGC 120 TTCTTAGTTG GGGACTCAAA CAAGCCGAAG AAGCGAATAA AACCCCAGTA TAAACCCGAT 180 AAAGTTTGGC GCATTCAAG 199 188 base pairs nucleic acid single linear DNA (genomic) NO NO 61 TTTCTCTCGT TTTAGCAGGA GCGTTGATTA GCTCCATACC GCAAGAGAGT CATGCCGCCT 60 TTTTCACAAC CGTGATCATT CCAGCCATTG TTGGGGGTAT CGCTACAGGC GCTGCTGTAG 120 GAACGGTCTC AGGGCTTCTT AGCTGGGGGC TCAAACAAGC CGAAGAAGCG AATAAAACCC 180 CAGATAAA 188 206 base pairs nucleic acid single linear DNA (genomic) NO NO 62 GAATCGCCCT TTAGTTTCTC TCGTTTTAGC AGGAGCGTTG ATTAGTGCCA TACCGCAAGA 60 GAGTCATGCC GCCTTTTTCA CGACCGTGAT CATTCCAGCC ATTGTTGGGG GTATCGCCAC 120 AGGCACCGCT GTAGGAACGG TCTCAGGGCT TCTTAGTTGG GGACTCAAAC AAGCCGAAGA 180 AGCGAATAAA ACCCCAGTAT AAACCC 206 206 base pairs nucleic acid single linear DNA (genomic) NO NO 63 AATCGCCCTT TGGTTTCTCT CGTTTTAGCA GGAGCGTTGA TTAGCTCCAT ACCGCAAGAG 60 AGTCATGCCG CCTTTTTCAC AACCGTGMTC ATTCCAGCCA TTGTTGGGGG TATGGCTACA 120 GGCGCTGCTG TAGGAACGGT CTNAGGGCTT YTTAGCTGGG GGCTNWAACA AGCCGAAGAA 180 GCGAATAAAA CCCCAGATAA ACCCGA 206 201 base pairs nucleic acid single linear DNA (genomic) NO NO 64 AATCGCCCTA TTATTTCTCT CGTTTTAGCA GGAGCGTTGA TTAGCTCCAT ACCGCAAGAA 60 AGTCATGCCG CCTTTTTCAC GACCGTGATC ATTCCAGCCA TTGTTGGGGG TATCGCCACA 120 GGCGCTGCTG TAGGAACGGT CTCAGGGCTT CTTAGCTGGG GGCTCAAACA AGCCGAACAA 180 GCCAATAAAG CCCCGGACAA A 201 197 base pairs nucleic acid single linear DNA (genomic) NO NO 65 GCCCTTTAGT TTCTCTTGTT TTAGCAGGAG CGTTGATTAG TGCCATACCG CAAGAGAGCT 60 ATGCCGCCTT TTTCACAACC GTGATCATTC CAGCCATTGT TGGGGGTATC GCTACAGGCA 120 CCGCTGTAGG AACGGTCTCA GGGCTTCTTA GTTGGGGACT CAAACAAGCC GAAGAAGCGA 180 ATAAAACCCC AGATAAA 197 201 base pairs nucleic acid single linear DNA (genomic) NO NO 66 AATCGCCCTT TAGTTTCTCT CGTTTTAGCA GGAGCGTTTA TTAGCGCCAT ACCGCAAGAG 60 AGTCATGCCG CCTTTTTCAC GACCGTGATC ATTCCAGCCA TTGTTGGGGG TATCGCCACA 120 GGCACCGCTG TAGGAACGGT CTCAGGGCTT CTTAGTTGGG GACTCAAACA AGCCGAAGAA 180 GCGAATAAAA CCCCAGATAA A 201 195 base pairs nucleic acid single linear DNA (genomic) NO NO 67 GTGGGGGTGT TAATGGGCAC CGAACTAGGG GCTAATACGC CAAACGATCC CATACACAGC 60 GAGAGTCGCG CCTTTTTCAC AACCGTGATC ATTCCAGCCA TTGTTGGGGG TATCGCCACA 120 GGCACTGCTG TAGGAACGGT CTCAGGGCTT CTTAGTTGGG GACTCAAACA AGCCGAAGAA 180 GCGAATAAAA CCCCA 195 196 base pairs nucleic acid single linear DNA (genomic) NO NO 68 AGTGGGGGTG TTAATGGGCA CTGAACTAGG GGCTAACACG CCAAACGATC CCATACACAG 60 CGAGAGTCGC GCCTTTTTTA CAACCGTGAT CATTCCAGCC ATTGTTGGGG GTATCGCTAC 120 AGGCGCTGCT GTAGGAACGG TTTCAGGGCT TCTTAGCTGG GGGCTCAAAC AAGCCGAACA 180 AGCCAATAAA GCCCCG 196 196 base pairs nucleic acid single linear DNA (genomic) NO NO 69 TAGTGGGGGC ATTAATGAGT ACCGAACTAG GGGCTAACAC GCCAAATGAT CCCATACACA 60 GCGAGAGTCG CGCCTTTTTT ACAACCGTGA TCATTCCAGC CATTGTTGGG GGTATCGCTA 120 CAGGCGCTGC TGTAGGAACG GTCTCAGGGC TTCTTAGTCG GGGGCTCAAA CAAGCCGAAC 180 AAGCCAATAA AGCCCC 196 232 base pairs nucleic acid single linear DNA (genomic) NO NO 70 CAATCGCCCT ATTATCTCTC TCGCTCTAGT GGGGGTGTTA ATGGGTACCG AACTAGGGGC 60 TAACACGCCA AACGATCCCA TACACAGCGA GAGTCGCGCC TTTTTTACAA CCGTGATCAT 120 TCCAGCCATT GTTGGGGGTA TCGCTACAGG CGCTGCTGTA GGAACGGTTT CAGGGCTTCT 180 TAGCTGGGGG CTCAAACAAG CCGAACAAGC CAATAAAGCC CCGGACAAAC CC 232 228 base pairs nucleic acid single linear DNA (genomic) NO NO 71 AATCGCCCTA TTATCTCTCT CGCTCTAGTG GGGGTGTTAA TGGGTACCGA ACTAGGGGCT 60 AACACGCCAA ACGATCCCAT ACACAGCGAG AGTCGCGCCT TTTTCACAAC CGTGATCATT 120 CCAGCCATTG TTGGAGGTAT CGCTACAGGT GCTGCTGTAG GAACGGTCTC AGGGCTTCTT 180 AGCTGGGGGC TCAAACAAGC CGAACAAGCC AATAAAGCCC CGGACAAA 228 228 base pairs nucleic acid single linear DNA (genomic) NO NO 72 AATCGCCCTA TTATCTCTCT CGCTTTAGTG GGGGTRTTAA TGGGCACCGA ACTAGGGGCT 60 AACACGCCAA ACGATCCCAT ACACAGCGAG AGTCGCGCCT TTTTCACAAC CGTGATCATT 120 CCAGCCATTG TTGGGGGTAT CGCTACAGGC GCTGCTGTAG GAACGGTCTC AGGGCTTCTT 180 AGCTGGGGGC TCAAACAAGC CGAACAAGCC AATAAAGCCC CGGATAAA 228 233 base pairs nucleic acid single linear DNA (genomic) NO NO 73 AATCGCCCTA TTATCTCTCT CGCTTTAGAG GGGGTGTTAA TAGGCACCGA ACTAGGGGCT 60 AACACGCCAA ATGATCCCAT ACACAGCGAG AGTCGCGCCT TTTTTACAAC CGTTATTATT 120 CCAGCCATTG TTGGGGGTAT CGCTACAGGC GCTGCTGTAG GAACGGTCTC AGGGCTTCTT 180 AGCTGGGGGC TCAAACAAGC CGAACAAGCC AATAAAGCCC CGGATAAACC CGA 233 300 base pairs nucleic acid single linear DNA (genomic) NO NO 74 TTTAAAGGTG GATGCTCATA CAGCTAATTT TAAAGGTATT GATACGGGTA ATGGTGGTTT 60 CAACACCTTA GATTTTAGTG GTGTTACAGG TAAGGTCAAT ATCAACAAGC TCATCACAGC 120 TTCCACTAAT GTGGCCGTTA AAAACTTCAA CATTAATGAA TTGATTGTTA AAACCAATGG 180 TGTGAGTGTG GGGGAATACA CTCATTTTAG CGAAGATATA GGCAGTCAAT CGCGCATCAA 240 TACCGTGCGT TTGGAAACTG GCACTAGGTC AATCTTTTCT GGGGGTGTTA AATTTAAAGG 300 300 base pairs nucleic acid single linear DNA (genomic) NO NO 75 TTTAAAGGTG GATGCTCATA CAGCTAATTT TAAAGGTATT GATACGGGTA ATGGTGGTTT 60 CAACACCTTA GATTTTAGTG GTGTTACAGG TAAGGTCAAT ATCAACAAGC TCATTACGGC 120 TTCCACTAAT GTGGCCGTTA AAAACTTCAA CATTAATGAA TTGTTGGTTA AGACCAATGG 180 GGTGAGTGTG GGGGAATACA CTCATTTTAG CGAAGATATA GGCAGTCAAT CGCGCATCAA 240 TACCGTGCGT TTGGAAACTG GCACTAGGTC AATCTTTTCT GGGGGTGTCA AATTTAAAGG 300 300 base pairs nucleic acid single linear DNA (genomic) NO NO 76 TTTAAAAGTG GATGCTCATA CAGCTAATTT TAAAGGTATT GATACGGGTA ATGGTGGTTT 60 CAACACCTTG GATTTTAGTG GCGTTACAGA CAAAGTCAAT ATCAACAAGC TCATCACAGC 120 TTCCACTAAT GTGGCCATTA AAAACTTCAA CATTAATGAA TTGTTGGTTA AGACCAATGG 180 GGTGAGTGTG GGGGAATACA CTCATTTTAG CGAAGATATA GGCAGTCAAT CGCGCATCAA 240 CACCGTGCGT TTAGAAACTG GCACTAGGTC AATCTTTTCT GGGGGTGTCA AATTTAAAAG 300 300 base pairs nucleic acid single linear DNA (genomic) NO NO 77 TTTAAAGGTG GATGCTCATA CAGCTAATTT TAAAGGTATT GATACGGGTA ATGGTGGTTT 60 CAACACCTTA GATTTTAGTG GTGTTACAGG TAAGGTCAAT ATCAACAAGC TCATCACAGC 120 TTCCACTAAT GTGGCCGCTA AAAACTTCAA CATTAATGAA TTGATTGTTA AAACCAATGG 180 GGTGAGTGTG GGGGAATACA CTCATTTTAG CGAAGATATA GGCAGTCAAT CGCGCATCAA 240 TACCGTGCGT TTGGAAACTG GCACTAGGTC AATCTATTCT GGCGGTGTTA AATTTAAAGG 300 300 base pairs nucleic acid single linear DNA (genomic) NO NO 78 TTTAAAAGTG GATGCTCATA CAGCTAATTT TAAAGGTATT GATACGGGTA ATGGTGGTTT 60 CAACACCTTA GATTTTAGTG GTGTTACAGG TAAGGTCAAT ATCAACAAGC TCATCACAGC 120 TTCCACTAAT GTGGCCGCTA AAAACTTCAA CATTAATGAA TTGATTGTTA AAACCAATGG 180 GGTGAGTGTG GGGGAATACA CTCATTTTAG CGAAGATATA GGCAGTCAAT CGCGCATCAA 240 TACCGTGCGT TTGGAAACTG GCACTAGGTC AATCTATTCT GGCGGTGTTA AATTTAAAGG 300 300 base pairs nucleic acid single linear DNA (genomic) NO NO 79 TTTAAAAGTG GATGCTCATA CAGCTAATTT TAAAGGTATT GATACTGGTA ATGGTGGTTT 60 CAACACCTTA GATTTTAGTG GTGTTACAAA CAAAGTCAAT ATCAACAAGC TCATTACAGC 120 TTCCACTAAT GTGGCCGTTA AAAACTTCAA CATTAATGAA TTGTTGGTTA AGATTAATGG 180 GGTGAGTGTG GGGGAATACA CTTATTTTAG CGAAGATATA GGCAGTCAAT CGCGCATCAA 240 CACCGTGCGT TTGGAAACTG GCACTAGGTC AATCTATTCT GGCGGTGTTA AATTTAAAGG 300 300 base pairs nucleic acid single linear DNA (genomic) NO NO 80 TTTAAAGGTG GATGCTCATA CAGCTAATTT TAAAGGTATT GATACGGGTA ATGGTGGTTT 60 CAACACCTTA GATTTTAGTG GTGTTACAGG TAAGGTCAAT ATCAACAAGC TCATCACGGC 120 TTCCACTAAT GTGGCCGTTA AAAACAACAA CATTAATGAA TTGGTGGTTA AAACCAATGG 180 GATAAGTGTG GGGGAATACA CTCATTTTAG CGAAGATATA GGCAGTCAAT CGCGCATCAA 240 TACCGTGCGT TTGGAAACAG GCACTAGGTC AATCTTTTCT GGGGGTGTCA AATTTAAAAG 300 300 base pairs nucleic acid single linear DNA (genomic) NO NO 81 TTTAAAAGTG GATGCTCATA CAGCTAATTT TAAAGGTATT GATACGGGTA ATGGTGGTTT 60 CAACACCTTA GATTTTAGTG GTGTTACAGG TAAGGTCAAT ATCAACAAGC TCATTACGGC 120 TTCCACTAAT GTAGCCGTTA AAAACTTCAA CATTAATGAA TTGTTGGTTA AGACCAATGG 180 GGTGAGTGTG GGGGAATACA CTCATTTTAG CGAAGATATA GGCAGTCAAT CGCGCATCAA 240 CACCGTGCGT TTGGAAACTG GCACTAGGTC AATCTTTTCT GGGGGTGTCA AATTTAAAAG 300 300 base pairs nucleic acid single linear DNA (genomic) NO NO 82 TTTAAAGGTG GATGCTCATA CAGCTAATTT TAAAGGTATT GATACGGGTA ATGGTGGTTT 60 CAACACCTTA GATTTTAGTG GTGTTACAGG TAAGGTCAAT ATCAACAAGC TCATCACAGC 120 TTCCACTAAT GTGGCCGTTA AAAACTTCAA CATTAATGAA TTGATTGTTA AAACCAATGG 180 GATAAGTGTG GGGGAATACA CTCATTTTAG CGAAGATATA GGAAGTCAAT CGCGCATCAA 240 TACCGTGCGT TTGGAAACTG GCACTAGATC AATCTTTTCT GGGGGTGTTA AATTTAAAGG 300 375 base pairs nucleic acid single linear DNA (genomic) NO NO 83 TTTAAGAGTG GACGCTCATA CAGCTTATTT TAATGGCAAT ATTTATCTGG GAAAATCCAC 60 GAATTTAAGA GTGAATGGCC ATAGCGCTCA TTTTAAAAAT ATTGATGCCA GCAAGAGCGA 120 TAACGGGCTA AACACTAGCG CTTTGGATTT CAGCGGCGTT ACAGACAAAG TCAATATCAA 180 CAAGCTCACT ACATCTGCCA CTAATGTGAA CGTTAAAAAC TTTGACGTTA AGGAATTGGT 240 GGTTACAACC CGTGTTCAGA GTTTTGGGCA ATACACTATT TTTGGCGAAA ATATAGGCGA 300 TAAGTCTCGC ATTGGTGTCG TGAGTTTGCA AACGGGATAT AGCCCGGCCT ATTCTGGGGG 360 CGTTACTTTT AAAAG 375 375 base pairs nucleic acid single linear DNA (genomic) NO NO 84 CTTAAGAGTG GATGCTCATA CAGCTTATTT TAATGGCAAT ATTTATTTGG GAAAATCCAC 60 GAATTTAAGA GTGAATGGCC ATAGCGCTCA TTTTAAAAAT ATTGATGCCA GTAAGAGCGA 120 TAACGGGCTA AACACTAGTG CTTTGGATTT TAGCGGCGTT ACAGATAAAG TCAATATCAA 180 CAAGCTCACT ACATCTGCCA CTAATGTGAA CGTTAAAAAC TTTGACATTA AGGAATTGGT 240 GGTTACAACC CGAGTTCAAA GTTTTGGGCA ATACACTATT TTTGGCGAAA ATATAGGCGA 300 TAAGTCTCGC ATTGGTGTCG TTAGTTTGCA AACGGGATAT AGCCCGGCCT ATTCTGGGGG 360 CGTTACTTTT AAAAG 375 374 base pairs nucleic acid single linear DNA (genomic) NO NO 85 TTTAAGAGTG GATGCTCATA CAGCTTATTT TAATGGCAAT ATTTATCTGG GAAAATCCAC 60 GAATTTAAGA GTGAATGGCC ATAGCGCTCA TTTTAAAAAT ATTGATGCCA GTAAGAGCGA 120 TAACGGGCTA AACACTACCA CTTTGGATTT CAGCGGCGTT ACAGATAAAG TCAATATCAA 180 CAAGCTCACT ACATCTGCCA CTAATGTGAA CATTAAAAAC TTTGACATTA AGGAATTAGT 240 GGTTACAACC CGAGTTCAGA GTTTTGGGCA ATACACTATT TTTGGCGAAA ATATAGGCGA 300 TAAGCTGCAC ATTGGTGTCG TGAGTTTGCA AACGGGATAT AGCCCAGCCT ATTCTGGGGG 360 GCTTACTTTT AAAG 374 375 base pairs nucleic acid single linear DNA (genomic) NO NO 86 TTTAAGAGTG GATGCTCATA CAGCTTATTT TAATGGCAAT ATTTATCTGG GAAAATCCAC 60 GAATTTAAGA GTGAATGGCC ATAGCGCTCA TTTTAAAAAT ATTGATGCCA GTAAGAGCGA 120 TAACGGGCTA AACACTAGCT CTTTGGATTT CAGTGGCGTT ACAGACAAAG TCAATATCAA 180 CAAGCTCACT ACATCTGCCA CTAATGTGAA CGTTAAAAAC TTTGACATTA AGGAATTGGT 240 GGTTACAACC CGCGTTCAGA GTTTTGGGCA ATACACTATT TTTGGCGAAA ATATAGGCGA 300 TAAGTCTCGC ATTGGTGTCG TTAGTTTGCA AACGGGATAT AGCCCGGCCT ATTCTGGGGG 360 CGTTACTTTT AAAAG 375 365 base pairs nucleic acid single linear DNA (genomic) NO NO 87 GATGCTCATA CAGCTTATTT TAATGGCAAT ATTTATCTGG GAAAATCCAC GAATTTAAGA 60 GTGAATGGCC ATAGCGCTCA TTTTAAAAAT ATTGATGCCA GTAAGAGCGA TAACGGGCTA 120 AACACTAGCG CTTTGGATTT YAGCGGCGTT ACAGAYAAAG TCAATATCAA CAAGCTCACT 180 ACATCTGCCA CTAATGTGAA CGTTAAAAAC TTTGACATTA AGGAATTAGT GGTTACAACC 240 CGAGTTCAAA GTTTTGGGCA ATACACTATT TTTGGCGAAA ATATAGGCGA TAAGTCTCGC 300 ATTGGTGTCG TTAGTTTGCA AACGGGATAT AGCCCGGCCT ATTCTGGGGG CGTTACTTTT 360 AAAAG 365 375 base pairs nucleic acid single linear DNA (genomic) NO NO 88 TTTAAGAGGG GATGCTCATA CAGCTTATTT TAATGGCAAT ATTTATTTGG GAAAATCCAC 60 GAATTTAAGA GTGAATGGCC ATAGCGCTCA TTTTAAAAAT ATTGATGCCA GTAAGAGCGA 120 TAACGGGCTA AACACTAGCG YTTTGGATTT TAGCGGCGTT ACAGAYAAAG TCAATATCAA 180 CAAGCTCACT ACATCTGCCA CTAATGTGAA CRTTAAAAAC TTTGAYATTA AGGAATTGGT 240 GGTTACAACC CGAGTTCAAA GTTTTGGGCA ATACACTATT TTTGGCGAAA ATATAGGCGA 300 TMAGTCTCGC ATTGGTGTCG TTAGTTTGCA AACGGGATAT AGCCCRGCCT ATTCTGGGGG 360 CGTTACTTTT AAAAG 375 375 base pairs nucleic acid single linear DNA (genomic) NO NO 89 TTTAAGCGTG GATGCTCATA CAGCTTATTT TAATGGTAAT ATTTATCTGG GAAAATCCAC 60 GAATTTAAGA GTGAATGGCC ATAGCGCTCA TTTTAAAAAT ATTGATGCCA CAAAGAGCGA 120 TAACGGGCTA AACACTAGCG CTTTGGATTT CAGCGGCGTT ACAGATAAAG TCAATATCAA 180 CAAGCTCACT ACATCTGCCA CTAACGTGAA CATTAAAAAC TTTGACATTA AGGAATTGGT 240 GGTTACAACC CGAGTTCAAA GTTTTGGGCA ATACACTATT TTTGGCGAAA ATATAGGCGA 300 TAAGTCTCGC ATTGGTGTCG TGAGTTTGCA AACGGGATAT AGCCCGGCCT ATTCTGGGGG 360 CGTTACTTTT AAAAG 375 375 base pairs nucleic acid single linear DNA (genomic) NO NO 90 TTTAAGAGTG GATGCTCATA CAGCTTATTT TAATGGCAAT ATTTATCTGG GAAAATCCAC 60 GAATTTAAAA GTGAATGGCC ATAGCGCTCA TTTTAAAAAT ATTGATGCCA GTAAGAGCGA 120 TAATGGTCTA AACACTAGTG CTTTGGATTT GAGCGGCGTT ACAGACAAAG TCAATATCAA 180 CAAGCTCACT ACAGCTGCCA CTAATGTGAA CATTAAAAAC TTTGACATTA AGGAATTAGT 240 GGTTACGACC CGTGTTCAGA GTTTTGGGCA ATACACTATT TTTGGCGAAA ATATAGGAGA 300 TCAATCGCGC ATTGGTGTCG TTAGTTTGCA AACTGGCTAT AGCCCGGCCT ATTCTGGGGG 360 CGTTACTTTT AAAAG 375 375 base pairs nucleic acid single linear DNA (genomic) NO NO 91 CTTAAGAGTG GATGCTCATA CAGCTTATTT TAATGGCAAT ATTTATCTGG GAAAATCCAC 60 GAATTTAAGA GTGAATGGCC ATAGCGCTCA TTTTAAAAAT ATTGATGCTA GTAAGAGCGA 120 TAACGGGCTA AACACTAGCG CTTTGGATTT TAGCGGCGTT ACAGACAAAG TCAATATCAA 180 CAAGCTCACT ACATCTGCCA CTAATGTGAA CATTAAAAAC TTTGACATTA AGGAATTGGT 240 GGTTACAACC CGAGTTCAAA GTTTTGGGCA ATACACTATT TTTGGCGAAA ATATAGGCGA 300 TAAGTCTCGC ATTGGTGTCG TGAGTTTGCA AACGGGATAT AGCCCGGCCT ATTCTGGGGG 360 CGTTACTTTT AAAAG 375 449 base pairs nucleic acid single linear DNA (genomic) NO NO 92 ATGACTAACG AAACCATTAA CCAACAACCA CAAAGCGAAG CGGCTTTTAA CCCGCAGCAA 60 TTTATCAATA ATCTTCAAGT AGCTTTTCTT AAAGTTGATA ACGCTGTCGC TTCATACGAT 120 CCTGATCAAA AACCAATCGT TGATAAGAAC GATAGGGATA ATAGGCAAGC TTTTGATGGA 180 ATCTCGCAAT TAAGGGAAGA ATACTCCAAT AAAGCGATCA AAAATCCTAC CAAAAAGAAT 240 CAGTATTTTT CAGACTTTAT CAATAAGAGC AATGATTTAA TCAACAAAGA CGCTCTCATT 300 GATGTAGAAT CTTCCACAAA GAGCTTTCAG AAATTTGGGG ATCAGCGTTA CCGAATTTTC 360 ACAAGTTGGG TGTCCCATCA AAACGATCCG TCTAAAATCA ACACCCGATC GATCCGAAAT 420 TTTATGGAAA ATATCATACA ACCCCCTAT 449 449 base pairs nucleic acid single linear DNA (genomic) NO NO 93 ATGACTAACG AAACCATTAA CCAACAACCA CAAACCGAAG CGGCTTTTAA CCCGCAGCAA 60 TTTATCAATA ATCTTCAAGT AGCTTTTCTT AAAGTTGATA ACGCTGTCGC TTCATACGAT 120 CCTGATCAAA AACCAATCGT TGATAAGAAC GATAGGGATA ACAGGCAAGC TTTTGATGGA 180 ATCTCGCAAT TAAGGGAAGA ATACTCCAAT AAAGCGATCA AAAATCCTAC CAAAAAGAAT 240 CAGTATTTTT CAGACTTTAT CAATAAGAGC AATGATTTAA TCAACAAAGA CGCTCTCATT 300 GATGTAGAAT CTTCCACAAA GAGCTTTCAG AAATTTGGGG ATCAGCGTTA CCGAATTTTC 360 ACAAGTTGGG TGTCCCATCA AAACGATCCG TCTAAAATCA ACACCCGATC GATCCGAAAT 420 TTTATGGAAA ATATCATACA ACCCCCTAT 449 449 base pairs nucleic acid single linear DNA (genomic) NO NO 94 ATGGCTAACG AAACTATTAA CCAACAACCA CAAACCGAAG CGGCTTTTAA CCCGCAGCAA 60 TTTATCAATA ATCTTCAAGT AGCTTTTCTT AAAGTTGATA ACGCTGTCGC TTCATACGAT 120 CCTGATCAAA AACCAATCGT TGATAAGAAC GATAGGGATA ACAGGCAAGC TTTTGATGGA 180 ATCTCGCAAT TAAGGGAAGA ATACTCCAAT AAAGCGATCA AAAATCCTAC CAAAAAGAAT 240 CAGTATTTTT CAGACTTTAT CAATAAGAGC AATGATTTAA TCAACAAAGA CGCTCTCATT 300 GATGTAGAAT CTTCCACAAA GAGCTTTCAG AAATTTGGGG ATCAGCGTTA CCGAATTTTC 360 ACAAGTTGGG TGTCCCATCA AAACGATCCG TCTAAAATCA ACACCCGATC GATCCGAAAT 420 TTTATGGAAA ATATCATACA ACCCCCTAT 449 449 base pairs nucleic acid single linear DNA (genomic) NO NO 95 ATGACTAACG AAACCATTAA CCAACAACCA CAAACCGAAG CGGCTTTTAA CCCGCAGCAA 60 TTTATCAATA ATCTTCAAGT GGCTTTTCTT AAAGTTGATA ACGCTGTCGC TTCATACGAT 120 CCTGATCAAA AACCAATCGT TGATAAGAAC GATAGGGATA ATAGGCAAGC TTTTGATGGA 180 ATCTCGCAAT TAAGGGAAGA ATACTCCAAT AAAGCGATCA AAAATCCTAC CAAAAAGAAT 240 CAGTATTTTT CAGACTTTAT CAATAAGAGC AATGATTTAA TCAACAAAGA CGCTCTCATT 300 GATGTAGAAT CTTCCACAAA GAGCTTTCAG AAATTTTGGG ATCAGCGTTA CCGAATTTTC 360 ACAAGTTGGG TGTCCCATCA AAACGATCCG TCTAAAATCA ACACCCGATC GATCCGAAAT 420 TTCATGGAAA ATATCATACA ACCCCCTAT 449 449 base pairs nucleic acid single linear DNA (genomic) NO NO 96 ATGACTAACG AAACCATTAA CCAACAACCA CAAACCGAAG CGGCTTTTAA CCCGCAGCAA 60 TTTATCAATA ATCTTCAAGT GGCTTTTCTT AAAGTTGATA ACGCTGTCGC TTCATACGAT 120 CCTGATCAAA AACCAATCGT TGATAAGAAC GATAGGGATA ACAGGCAAGC TTTTGATGGA 180 ATCTCGCAAT TAAGGGAAGA ATACTCCAAT AAAGCGATCA AAAATCCTAC CAAAAAGAAT 240 CAGTATTTTT CAAACTTTAT CAATAAGAGC AATGATCTAA TCAACAAAGA CAATCTCATT 300 GATGTAGAAT CTTCCAAAAA GAGCTTTCAG AAATTTGGGG ATCAGCGTTA CCGAATTTTC 360 ACAAGTTGGG TGTCCCATCA AAACGATCCG TCTAAAATCA ACACCCGATC GATCCGAAAT 420 TTTATGGAAA ATATCATACA ACCCCCTAT 449 449 base pairs nucleic acid single linear DNA (genomic) NO NO 97 ATGACTAACG AAACCATTAA CCAACAACCA CAAACCGAAG CGGCTTTTAA CCCGCAGCAA 60 TTTATCAATA ATCTTCAAGT AGCTTTTCTT AAAGTTGATA ACGCTGTCGC TTCATACGAT 120 CCTGACCAAA AACCAATCGT TGATAAGAAC GATAGGGATA ACAGGCAAGC TTTTGATAGA 180 ATCTCACAAT TAAGGGAGGA ATACTCCAAT AAAGCGATCA AAAATCCTAC CAAAAAGAAT 240 CAGTATTTTT CAGACTTTAT CGATAAGAGC AACGATTTAA TCAACAAAGA CGCTCTCATT 300 GATGTAGAAT CTTCCACAAA GAGCTTTCAG AAATTTGGGG ATCAGCGTTA CCGAATTTTC 360 ACAAGTTGGG TGTCCCATCA AAACGATCCG TCTAAAATCA ACACCCGATC GATCCGAAAT 420 TTTATGGAAA ATATCATACA ACCCCCTAT 449 449 base pairs nucleic acid single linear DNA (genomic) NO NO 98 ATGACTAACG AAACCATTAA CCAACAACCA CAAACCGAAG CGGCTTTTAA CCCGCAGCAA 60 TTTATCAATA ATCTTCAAGT GGCTTTTCTT AAAGTTGATA ACGCTGTCGC TTCATACGAT 120 CCTGATCAAA AACCAATTAT TGATAAGAAC GATAGGGATA ACAGGCAAGC TTTTGATGGA 180 ATCTCGCAAT TAAGGGAAGA ATATTCCAAT AAAGCGATCA AAAATCCTAC CAAAAAGAAT 240 CAGTATTTTT CAGACTTTAT CGATAAGAGC AATGATTTAA TCAACAAAGA CAATCTCATT 300 GATGTAGAAT CTTCCACAAA GAGCTTTCAG AAATTTGGGG ATCAGCGTTA CCGAATTTTC 360 ACAAGTTGGG TGTCCCATCA AAACGATCCG TCTAAAATCA ACACCCGATC GATCCGAAAT 420 TTTATGGAAA ATATCATACA ACCCCCTAT 449 449 base pairs nucleic acid single linear DNA (genomic) NO NO 99 ATGACTAACG AAACCATTAA CCAACAACCA CAAACCGAAG CGGCTTTTAA CCCGCAGCAA 60 TTTATCAATA ATCTTCAAGT AGCTTTTCTT AAAGTTGATA ATGCTGTCGC TTCATACGAT 120 TCTGATCAAA AACCAATCAT TGATAAGAAC GATAGGGATA ACAGGCAAGC TTTTGATAGA 180 ATCTCGCAAT TAAGGGAAGA ATACTCCAAT AAAGCGATCA AAAATCCTAC CAAAAAGAAT 240 CAGTATTTTT CAGACTTTAT CGATAAGAGC AACGATTTAA TCAACAAAGA CAATCTCATT 300 GATGTAGAAT CTTCCACAAA GAGCTTTCAG AAATTTGGGG ATCAGCGTTA CCGAATTTTC 360 ACAAGTTGGG TGTCCCATCA AAATGATCCG TCTAAAATCA ACACCCGATC GATCCGAAAT 420 TTTATGGAAA ATATCATACA ACCCCCTAT 449 449 base pairs nucleic acid single linear DNA (genomic) NO NO 100 ATGACTAACG AAACTATTGA CCAACAACCA CAAACCGAAG CGGCTTTTAA CCCGCAGCAA 60 TTTATTAATA ATCTTCAGGT AGCTTTTCTT AAGCTTGATA ACGCTGTCGC TTCATTTGAT 120 CCTGATCAAA AACCAATCGT TGATAAGAAT GATAGGGATA ACAGGCAAGC TTTTGATGGA 180 ATCTCGCAAT TAAGGGAAGA ATACTCCAAT AAAGCGATCA AAAATCCTAC CAAAAAGAAT 240 CAGTATTTTT CAGACTTTAT CAATAAGAGC AATGATTTAA TCAACAAAGA CGCTCTCATT 300 GATGTAGAAT CTTCCACAAA GAGCTTTCAG AAATTTGGGG ATCAGCGTTA CCGAATTTTC 360 ACAAGTTGGG TGTCCCATCA AAACGATCCG TCTAAAATCA ACACCCGATC GATCCAAAAT 420 TTTATGGAAA ATATCATACA ACCCCCTAT 449 449 base pairs nucleic acid single linear DNA (genomic) NO NO 101 ATGACTAACG AAACTATTGA CCAACAACCA CAAACTGAAG CGGCTTTTAA CCCGCAGCAA 60 TTTATCAATA ATCTTCAAGT GGCTTTTCTT AAGCTTGATA ACGCTGTCGC TTCATTTGAT 120 CCTGATCAAA AACCAATCGT TGATAAGAAC GATAGGGATA ACAGGCAAGC TTTTGATGGA 180 ATCTCGCAAT TAAGGGAAGA ATACTCCAAT AAAGCGATCA AAAATCCTAC CAAAAAGAAT 240 CAGTATTTTT CAGACTTTAT CAATAAGAGC AATGATTTAA TCAACAAAGA CGCTCTCATT 300 GATGTAGAAT CTTCCACAAA GAGCTTTCAG AAATTTGGGG ATCAGCGTTA CCGAATTTTC 360 ACAAGTTGGG TGTCCCATCA AAACGATCCG TCTAAAATCA ACACCCGATC GATCCGAAAT 420 TTTATGGAAA ATATCATACA ACCCCCTAT 449 449 base pairs nucleic acid single linear DNA (genomic) NO NO 102 ATGACTAACG AAACTATTAA CCAACAGCCA CAAACCGAAG CGGCTTTTAA CCCGCAGCAA 60 TTTATCAATA ATCTTCAAGT AGCTTTTCTT AAGCTTGATA ACGCTGTCGC TTCATTTGAT 120 CCTGATCAAA AACCAATCGT TGATAAGAAC GATAGGGATA ATAGGCAGGC TTTTGATGGA 180 ATCTCGCAAT TAAGGGAAGA ATACTCCAAT AAAGCGATCA AAAATCCTAC CAAAAAGAAT 240 CAGTATTTTT CAGACTTTAT CAATAAGAGC AATGATTTAA TCAACAAAGA CAATCTCATT 300 GATGTAGAAT CTTCCACAAA GAGCTTTCAG AAATTTGGGG ATCAGCGTTA CCGAATTTTC 360 ACAAGTTGGG TGTCCCATCA AAACGATCCG TCTAAAATCA ACACCCGATC GATCCGAAAT 420 TTTATGGAAA ATATCATACA ACCCCCTAT 449 449 base pairs nucleic acid single linear DNA (genomic) NO NO 103 ATGACTAACG AAACCATTAA CCAACAACCA CAAACCGAAG CGGCTTTTAA CCCGCAGCAA 60 TTTATCAATA ATCTTCAAGT GGCTTTTCTT AAGCTTGATA ATGCTGTTGC TTCATTTGAT 120 CCTGATCAAA AACCAATCGT TGATAAGAAC GATAGGGATA ACAGGCAAGC TTTTGATGGA 180 ATCTCGCAAT TAAGGGAAGA ATACTCCAAT AAAGCGATCA AAAATCCTAC CAAAAAGAAT 240 CAGTATTTTT CAGACTTTAT CGATAAGAGC AACGATTTAA TCAACAAAGA CAATCTCATT 300 GATGTAGAAT CTTCCACAAA GAGCTTTCAG AAATTTGGGG ATCAGCGTTA CCGAATTTTC 360 ACAAGTTGGG TGTCCCATCA AAACGATCCG TCTAAAATCA ACACCCGATC GATCCGAAAT 420 TTTATGGAAA ATATCATACA ACCCCCTAT 449 449 base pairs nucleic acid single linear DNA (genomic) NO NO 104 ATGACTAACG AAACCATTAA CCAACAACCA CAAACCGAAG CGGCTTTTAA CCCGCAGCAA 60 TTTATCAATA ATCTTCAAGT GGCTTTTCTT AAAGTTGATA ACGCTGTCGC TTCATACGAT 120 CCTGATCAAA AACCAATCGT TGATAAGAAC GATAGGGATA ATAGGCAAGC TTTTGATGGA 180 ATCTCGCAAT TAAGGGAAGA ATACTCCAAT AAAGCGATCA AAAATCCTAC CAAAAAGAAT 240 CAGTATTTTT CAGACTTTAT CAATAAGAGC AATGATTTAA TCAACAAAGA CGCTCTCATT 300 GATGTAGAAT CTTCCACAAA GAGCTTTCAG AAATTTTGGG ATCAGCGTTA CCGAATTTTC 360 ACAAGTTGGG TGTCCCATCA AAACGATCCG TCTAAAATCA ACACCCGATC GATCCGAAAT 420 TTCATGGAAA ATATCATACA ACCCCCTAT 449 449 base pairs nucleic acid single linear DNA (genomic) NO NO 105 ATGACTAACG AAACTATTGA TCAACAACCA CGAACCGAAG CGGCTTTTAA CCCGCAGCAA 60 TTTATCAATA ATCTTCAAGT AGCTTTTCTT AAAGTTGATA ACGTTGTCGC TTCATTTGAT 120 CCTAATCAAA AACCAATCGT TGATAAGAAC GATAGGGATA ACAGGCAAGC TTTTGATGGA 180 ATCTCGCAAT TAAGGGAAGA ATACTCCAAT AAAGCGATCA AAAATCCTGC CAAAAAGAAT 240 CAGTATTTTT CAGACTTTAT CAATAAGAGC AATGATCTAA TCAACAAAGA CAATCTCATT 300 GATGTAGAAT CTTCCACAAA GAGCTTTCAG AAATTTGGGG ATCAGCGTTA CCAAATTTTC 360 ACAAGTTGGG TGTCCCATCA AAACGATCCG TCTAAAATCA ACACCCGATC GATCCGAAAT 420 TTTATGGAAA ATATCATACA ACCCCCTAT 449 449 base pairs nucleic acid single linear DNA (genomic) NO NO 106 ATGACTAACG AAACCATTAA CCAACAACCA CAAACCGAAG CGGCTTTTAA CCCGCAGCAA 60 TTTATCAATA ATCTTCAAGT GGCTTTTATT AAAGTTGATA ATGTTGTCGC TTCATTTGAT 120 CCTGATCAAA AACCAATCGT TGATAAGAAT GATAGGGATA ATAGGCAAGC TTTTGAGAAA 180 ATCTCGCAGC TAAGGGAGGA ATTCGCTAAT AAAGCGATCA AAAATCCTGC CAAAAAGAAT 240 CAGTATTTTT CAAGCTTTAT CAGTAAGAGC AGTGATTTAA TCAACAAAGA CAGTCTCATT 300 GATACAGGTT CTTCCATAAA GAGCTTTCAG AAATTTGGGA CTCAGCGTTA CCAAATTTTT 360 ATGAATTGGG TGTCCCATCA AAAAGATCCA TCTAAAATCA ACACCCAAAA AATCCGAGGT 420 TTTATGGAAA ATATCATACA ACCCCCTAT 449 464 base pairs nucleic acid single linear DNA (genomic) NO NO 107 ATGACTAACG AAACTATTGA TCAAACAAGA ACACCAGACC AAACACAAAG CCAAACAGCT 60 TTTGATCCGC AACAATTTAT CAATAATATT CAAGTGGCTT TTCTTAAAGT TGATAACGCT 120 GTCGCTTCAT TTGATCCTGA TCAAAAACCA ATCGTTGATA AGAATGATAG GGATAACAGG 180 CAAGCTTTTG AGAAAATCTC GCAGCTAAGG GAGGAATTCG CTAATAAAGC GATCAAAAAT 240 CCTGCCAAAA AGAATCAGTA TTTTTCAAGC TTTATCAGTA AGAGCAGTGA TTTAGTCAAC 300 AAAGACAGTC TCATTGATAC AGGTTCTTCC ATAAAGAGCT TTCAGAAATT TGGGACTCAG 360 CGTTACCAAA TTTTTATGAA TTGGGTGTCC CATCAAAAAG ATCCATCTAA AATCAACACC 420 CAAAAAATCC AAGATTTTAT GGAAAATATC ATACAACCCC CTAT 464 464 base pairs nucleic acid single linear DNA (genomic) NO NO 108 ATGACTAATG AAACCATTGA TCAAACAACA ACACCAGATC AAACACCAAA TCAAACAGAT 60 TTTGTTCCGC AACGATTTAT CAATAATCTT CAAGTAGCTT TTATTAAAGT TGATAACGCT 120 GTCTCTTCAT TTGATCCTGA TCAAAAACCA ATCGTTGATA AGAATGATAG GGATAACAGG 180 CAAGCTTTTG AGAAAATCTC GCAACTAAGG GAAGAATACG CCAATAAAGC GATCAAAAAT 240 CCTGCCAAAA AGAATCAGTA TTTTTCAGAC TTTATCAATA AGAGCAATGA TTTGATCAAC 300 AAAGACAATC TCATTGCTGT AGATTCTTCC GTAGAGACCT TTAAGAAATT TGGGGATCAG 360 CGTTACCAAA TTTTTACGAA TTGGGTGTCC CTTCAAAAAG ATCCGTCTAA AATCAACACC 420 CGACAAATCC GAAATTTTAT GGAAAATATC ATACAACCCC CTAT 464 464 base pairs nucleic acid single linear DNA (genomic) NO NO 109 ATGACTAACG AAACCATTGA TCAAACAACA ACACCAGATC AAACACCAAA CCAAACGGAT 60 TTTGTTCCGC AACGATTTAT CAATAATCTT CAAGTAGCTT TCCTTAAAGT TGATAGCGCT 120 GTCGCTTCAT TTGATCCTGA TCAAAAACCA ATCGTTGATA AGAATGATAG GGATAACAGG 180 CAAGCTTTTG AGAAAATCTC GCAACTAAGG GAAGAATACG CCAATAAAGC GATCAAAAAT 240 CCTGCCAAAA AGAATCAGTA TTTTTCAGAC TTTATCAATA AGAGCAATGA TTTGATCAAC 300 AAAGACAATC TCATTGCTGT AGATTCTTCT GTAGAGAGCT TTAAGAAATT TGGGGATCAG 360 CGTTACCAAA TTTTTACGAG TTGGGTGTCC CTTCAAAAAG ATCCGTCTCA AATCAACACC 420 CGACAAATCC GAAATTTTAT GGAAAATATC ATACAACCCC CTAT 464 464 base pairs nucleic acid single linear DNA (genomic) NO NO 110 ATGACTAACG AAACCATTGA TCAAACAACA ACACCAGATC AAACACCAAG CCAAACAGAT 60 TTTGTTCCGC AACGATTTAT CAATAATCTT CAAGTAGCTT TTCTTAAAGT TGATAACGCT 120 GTCGCTTCAT TTGATCCTGA TCAAAAACCA ATCGTTGATA AGAATGATAG GGATAACAGG 180 CAAGCTTTTG AGAAAATCTC GCAACTAAGG GAAGAATACG CCAATAAAGC GATCAAAAAT 240 CCTGCCAAAA AGAATCAGTA TTTTTCAGAC TTTATCAATA AGACCAATGA TTTGATCAAC 300 AAAGACAATC TCATTGCTGT AGATTCTTCC GTAGATAGCT TTAAGAAATT TGGGGATCAG 360 CGTTACCAAA TTTTTACGAG TTGGGTGTCC CTTCAAAAAG ATCCGTCTAA AATCAACACC 420 CAACAAATCC GAAATTTTAT GGAAAATATC ATACAACCCC CTAT 464 464 base pairs nucleic acid single linear DNA (genomic) NO NO 111 ATGACTAACG AAACCATTGA TCAAACAACA ACACCAGATC AAACACCAAA TCAAACAGAT 60 TTTGTTCCGC AACGATTTAT CAATAATCTT CAAGTAGCTT TTATTAAAGT TGATGACGCT 120 GTCGCTTCAT TTGATCCCGA TCAAAAACCA ATCGTTGATA AGAATGATAG GGATAACAGG 180 CAAGCTTTTG AGAAAATCTC GCAACTAAGG GAAGAATACG CCAATAAAGC GATCAAAAAT 240 CCCACCAAAA AGAATCAGTA TTTTTCAGAC TTTATCAATA AGACCAATGA TTTGATCAAC 300 AAAGACAATC TCATTGCTGT AGATTCTTCC GTAGAGAGCT TTAAGAAATT TGGGGATCAG 360 CGTTACCAAA TTTTTACGAG TTGGGTGTCC CTTCAAAAAG ATCCGTCTAA AATCAACACC 420 CAACAAATCC GAAATTTTAT GGAAAATATC ATACAACCCC CTAT 464 464 base pairs nucleic acid single linear DNA (genomic) NO NO 112 ATGACTAACG AAACCATTGA TCAAACAACA ACACCAGATC AAACACCAAA TCAAACAGAT 60 TTTGTTCCGC AACGATTTAT CAATAATCTT CAAGTAGCTT TTATTAAAGT TGATAACGCT 120 GTTGCTTCAT TTGATCCCGA TCAAAAACCA ATCGTTGATA AGAATGATAG GGATAACAGG 180 CAAGCTTTTG AGAAAATCTC GCAACTAAGG GAAGAATACG CCAATAAAGC GATCAAAAAT 240 CCTGCCAAAA AGAATCAGTA TTTTTCAGAC TTTATCAATA AGAGCAATGA TTTGATCAAC 300 AAAGACAATC TCATTGCTGT AGATTCTTCC GTAGATAGCT TTAAGAAATT TGGGGATCAG 360 CGTTACCAAA TTTTTACGAG TTGGGTGTCC CTTCAAAAAG ATCCGTCTCA AATCAACACC 420 CAACAAATCC AAAATTTTAT GGAAAATATC ATACAACCCC CTAT 464 464 base pairs nucleic acid single linear DNA (genomic) NO NO 113 ATGACTAACG AAACCATTGA TCAAACAACA ACACCAGATC AAACACTAAA CCAAACGGAT 60 TTTGTTCCGC AACGATTTAT CAATAATCTT CAAGTAGCTT TTATTAAAGT TGATAACGCT 120 GTCGCTTTAT TTGATCCCGA TCAAAAACCA ATCGTTGATA AGAATGATAG GGATAACAGG 180 CAAGCTTTTG AGAAAATCTC GCAACTAAGG GAAGAATACG CCAATAAAGC GATCAAAAAT 240 CCCACCAAAA AGAATCAGTA TTTTTCAGAC TTTATCAATA AGAGCAATGA TTTGATCAAC 300 AAAGACAATC TCATTGCTGT AGATTCTTCC GTAGATAGCT TTAAGAAATT TGGGGATCAG 360 CGTTACCAAA TTTTTACGAG TTGGGTGTCC CTTCAAAAAG ATCCGTCTCA AATCAACACC 420 CGACAAATCC GAAATTTTAT GGAAAATATC ATACAACCCC CTAT 464 464 base pairs nucleic acid single linear DNA (genomic) NO NO 114 ATGACTAACG AAACCATTGA TCAAACAATA ACACCAGATC AAACACCAAA CCAAACGGAT 60 TTTGTTCCGC AACGATTTAT CAATAATCTT CAAGTAGCTT TTATCAAAGT TGATAACGCT 120 GTCGCTTCAT TTGATCCTGA TCAAAAACCA ATCGTTGATA AGAATGATAG GGATAACAGG 180 CAAGCTTTTG AGAAAATCTC GCAATTAAGG GAAGAATACG CCAATAAAGC GATCAAAAAT 240 CCTGCCAAAA AGAATCAGTA TTTTTTAGAC TTTATCAATA AGAGCAATGA TTTGATCAAC 300 AAAGACAATC TCATTGCTGT AGATTCTTCC GTAGATAGCT TTAAGAAATT TGGGGATCAG 360 CGTTACCAAA TTTTTACGAG TTGGGTGTCC CTTCAAAAAG ATCCGTCTAA AATCAACACC 420 CAACAAATCC GAAATTTTAT GGAAAATATC ATACAACCCC CTAT 464 132 base pairs nucleic acid single linear DNA (genomic) NO NO 115 CCCTATTATC TCTCTCGCTC TAGTGGGGGT GTTAATGGGT ACCGAACTAG GGGCTAACAC 60 GCCAAACGAT CCCATACACA GCGAGAGTCG CGCCTTTTTT ACAACCGTGA TCATTCCAGC 120 CATTGTTGGG GG 132 132 base pairs nucleic acid single linear DNA (genomic) NO NO 116 CCCTATTATC TCTCTCGCTC TAGTGGGGGT GTTAATGGGT ACCGAACTAG GGGCTAACAC 60 GCCAAACGAT CCCATACACA GCGAGAGTCG CGCCTTTTTC ACAACCGTGA TCATTCCAGC 120 CATTGTTGGA GG 132 132 base pairs nucleic acid single linear DNA (genomic) NO NO 117 CCCTATTATC TCTCTCGCTC TAGTGGGGGT GTTAATGGGC ACCGAACTAG GGGCTAATAC 60 GCCAAACGAT CCCATACACA GCGAGAGTCG CGCCTTTTTC ACAACCGTGA TCATTCCAGC 120 CATTGTTGGG GG 132 132 base pairs nucleic acid single linear DNA (genomic) NO NO 118 CCCTATTATC TCTCTCGCTC TAGTGGGGGT GTTAATGGGC ACCGAACTAG GGGCTAATAC 60 GCCAAACGAT CCCATACACA GCGAGAGTCG CGCCTTTTTC ACAACCGTGA TCATTCCAGC 120 CATTGTTGGG GG 132 132 base pairs nucleic acid single linear DNA (genomic) NO NO 119 CCCTATTATC TCTCTCGCTC TAGTGGGGGT GTTAATGGGC ACTGAACTAG GGGCTAACAC 60 GCCAAACGAT CCCATACACA GCGAGAGTCG CGCCTTTTTT ACAACCGTGA TCATTCCAGC 120 CATTGTTGGG GG 132 132 base pairs nucleic acid single linear DNA (genomic) NO NO 120 CCCTATTATC TCTCTCGCTC TAGTGGGGGT GTTAATGGGC ACCGAACTAG GGGCTAACAC 60 GCCAAATGAT CCCATACACA GCGAGAGTCG CGCCTTTTTC ACAACYGTGA TCATTCCAGC 120 CATTGTTGGG GG 132 132 base pairs nucleic acid single linear DNA (genomic) NO NO 121 CCCTATTATC TCTCTCGCTT TAGTGGGGGT RTTAATGGGC ACCGAACTAG GGGCTAACAC 60 GCCAAACGAT CCCATACACA GCGAGAGTCG CGCCTTTTTC ACAACCGTGA TCATTCCAGC 120 CATTGTTGGG GG 132 132 base pairs nucleic acid single linear DNA (genomic) NO NO 122 CCCTATTATT TCTCTCGCTT TAGTGGGGGT GTTAATGGGC ACCGAACTAG GGGCTAACAC 60 GCCAAACGAT CCCATACACA GCGAGAGTCG CGCCTTTTTT ACAACCGTGA TCATTCCAGC 120 CATTGTTGGG GG 132 132 base pairs nucleic acid single linear DNA (genomic) NO NO 123 CCCTATTATC TCTCTCGCTC TAGTGGGGGC ATTAATGAGT ACCGAACTAG GGGCTAACAC 60 GCCAAATGAT CCCATACACA GCGAGAGTCG CGCCTTTTTT ACAACCGTGA TCATTCCAGC 120 CATTGTTGGG GG 132 132 base pairs nucleic acid single linear DNA (genomic) NO NO 124 CCCTATTATC TCTCTCGCTT TAGTGGGGGT ATTAATAGGC ACCGAACTAG GGGCTAATAC 60 GCCAAATGAT CCCATACACA GCGAGAGTCG TGCTTTTTTC ACAACCGTGA TCATTCCAGC 120 CATTGTTGGG GG 132 132 base pairs nucleic acid single linear DNA (genomic) NO NO 125 CCCTATTATC TCTCTCGCTC TAGTGGGGGT GTTAATAAGC ACCGAACTAG GGGCTAACAC 60 GCCAAATGAT CCCATACACA GCGAGAGTCG CGCCTTTTTC ACAACGGGGA TCATTCCAGC 120 CATTGTTGGG GG 132 105 base pairs nucleic acid single linear DNA (genomic) NO NO 126 CCCTTTAGTT TCTCTCGCTT TAGTGGGGTT ATTGGTCAGC ATCACACCAC AAAAAAGTCA 60 TGCTGCCTTC TTTACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 127 CCCTTTAGTT TCTCTCGCTT TAGTGGGGTT ATTGGTCAGC ATCACACCAC AAAAAAGTCA 60 TGCTGCCTTC TTTACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 128 CCCTTTAGTT TCTCTCGCTT TAGTGGGGTT ATTGGTCAGC ATCACACCGC AAAAAAGTCA 60 TGCTGCCTTC TTTACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 129 CCCTCTAGTT TCTCTCGCTT TAGTGGGGTT ATTGGTCAGC ATCACACCGC AAAAAAGTCA 60 TGCCGCCTTC TTTACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 130 CCCTCTAGTT TCTCTCGCTT TAGTGGGGTT ATTGGTCAGC ATCACACCGC AAAAAAGTCA 60 TGCCGCCTTC TTTACAACCG TGATCATTCC ADCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 131 CCCTTTAGTT TCTCTCGCTT TAGTGGGGTT ATTGGTCAGC ATCACACCGC AAAAAAGTCA 60 TGCCGCCTTC TTTACAACCG TGATCATTCC GGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 132 CCCTTTAGTT TCTCTCGCTT TAGTGGGGTT ATTGGTCAGC ATCACACCGC AAAAAAGTCA 60 TGCCGCCTTC TTTACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 133 CCCTTTAGTT TCTCTCGCTT TAGTGGGGCT ATTGGTCAGC ATCACACCGC AAAAAAGTCA 60 TGCCGCCTTC TTTACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 134 CCCTTTAGTT TCTCTCGCTT TAGTGGGGTT ATTGGTCAGC ATCACACCGC AAAAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC GGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 135 CCCTTTAGTT TCTCTCGCTT TAGTGGGGCT ATTGGTCAGC ATCACACCGC AAAAAAGTCA 60 TGCCGCCTTC TTTACAACCG TGATTATCCC GGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 136 CCCTTTAGTT TCTCTCGCTT TAGTGGGGCT ATTGGTCAGC ATCACACCAC AAAAAAGTCA 60 TGCCGCCTTT TTTACAACCG TGATCATTCC AGCCATTGTT GGGAG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 137 CCCTTTAGTT TCTCTCGCTT TAGTGGGGCT ATTGGTCAGC ATCACACCAC AAAAAAGTCA 60 TGCCGCCTTT TTTACAACCG TGATCATTCC AGCCATTGTT GGAGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 138 CCCTCTGGTA TCTCTCGCTT TAGTGGGGCT ATTGGTCAGC ATCACACCGC AAAAAAGTCA 60 TGCCGCCTTC TTTACAACCG TGAATATTCC AGCMATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 139 CCCTCTGGTT TCTCTCGCTT TAGTGGGGCT ATTGGTCAGC ATCACACCGC AAAAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC GGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 140 CCCTCTGGTT TCTCTCGCTT TAGTGGGGCT ATTGGTCAGC ATCACACCGC AAAAAAGTCA 60 TGCCGCCTTC TTTACAACCG TGATCATTCC AGCMATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 141 CCCTCTGGTT TCTCTCGCTT TAGTGGGGCT ATTGGTCAGC ATCACACCGC AAAAAAGTCA 60 TGCCGCCTTC TTTACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 142 CCCTCTGGTT TCTCTCGCTT TAGTGGGGCT ATTGGTCAGC ATCACACCAC AAAAAAGTCA 60 TGCCGCCTTC TTTACAACCG TGATCATTCC AGCHATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 143 CCCTCTGGTT TCTCTCGCTT TAGTGGGGCT ATTGGTCAGC ATCACACCAC AAAAAAGTCA 60 TGCCGCCTTC TTTACAACCG TGATCATTCC AGCYATTGTT GGAGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 144 CCCTCTGGTT TCTCTCGCTT TAGTGGGGTT ATTGGTCAGC ATCACACCGC AAAAAAGTCA 60 TGCCGCCTTC TTTACAACCG TGATCATTCC AGCCATTGTT GGAGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 145 CCCTTTAGTT TCTCTCGCTT TAGTGGGGTT ATTGGTTAGC ATCACACCGC AAAAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATTATTCC AGCCATTGGT TGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 146 CCCTCTGGTT TCTCTCGCTT TAGTGGGGCT ATTGGTCAGC ATCACACCAC AAAAAAGTCA 60 TGCCGCCTTC TTTACAACCG TGATCATTCC AGCCATCGTT CCCCC 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 147 CCCTTTAGTT TCTCCTGTTT TAGCAGGAGC GTTGATTAGC TCCATACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 148 CCCTTTAGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGT GCCATACCGC AAGAAAGTCA 60 TGCCGCCTTT TTTACAACCG TAATCATTCC AGCTATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 149 CCCTTTAGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC GCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACGACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 150 CCCTTTAGTT TCTCTCGTTT TAGCAGGAGC GTTTATTAGC GCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACGACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 151 CCCTTTAGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC GCCATACCGC AAGAAAGTTA 60 TGCCGCCTTT TTCACGACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 152 CCCTTTAGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC GCCATACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 153 CCCTATTATT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC TCCATACCGC AAGAAAGTCA 60 TGCCGCCTTT TTCACGACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 154 CCCTTTAGTT TCTCTCGTTT TAGCAGNAGC GTTNATTAGT GCCATACCGC AANANAGTCA 60 TGCCGCCTTT TTCACNACCG TNATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 155 CCCTTTGGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC GCCATACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTG GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 156 CCCTTTAGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC GCCATACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 157 CCCTTTAGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC GCCATACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 158 CCCTTTAGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC TCCATACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 159 CCCTTTAGTT TCTCTTGTTT TAGCAGGAGC GTTGATTAGC GCCATACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACGACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 160 CCCTTTAGTT TCTCTTGTTT TAGCAGGAGC GTTGATTAGC GCCATACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACGACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 161 CCCTTTGGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC GCCATACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCVATTGTG GGGAG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 162 CCCTTTAGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC GCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACGACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 163 CCCTTTAGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGT GCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACGACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 164 CCCTTTAGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGT GCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACGACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 165 CCCTTTAGTT TCTCTCGTTT TAGCAGGAGC GTTGGTTAGT GCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACGACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 166 CCCTTTAGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGT GCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC ARCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 167 CCCTTTAGTT TCTCTTGTTT TAGCAGGAGC GTTGATTAGC GCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACAACCG TAATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 168 CCCTTTAGTT TCTCTTGTTT TAGCAGGAGC GTTGATTAGC GCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACGACCG TAATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 169 CCCTTTAGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC GCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACAACCG TNATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 170 CCCTTTGGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC TCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACACCCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 171 CCCTTTGGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC TCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 172 CCCTTTGGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC TCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 173 CCCTTTGGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC TCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 174 CCCTTTAGTT TCTCTCGTTT TAGCAGGAGC GTTGATTAGC TCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 175 CCCTTTAGTT TCTCTCGTTT TAGCANNAGC GTTGATTAGC RCCATACCGC AAGAGAGTCA 60 TGCCGCCTTT TTTACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 176 CCCTTTAGTT TCTCTTGTTT TAGCAGGAGC GTTGATTAGC GCCATACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACGACCG TNATCATTCC AGCCATTGTT NNNNG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 177 CCCTCTGGTT TCTCTGGTTT TAGCAGGAGC GTTGATTAGC ATCACACCAC AACAAAGTCA 60 TGCCGCCTTT TTCACGACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 178 CCCTCTGGTT TCTCTCGCTT TAGTAGGAGC ATTGGTCAGC ATCACACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 179 CCCTCTGGTT TCTCTCGCTT TAGTAGGAGC ATTGGTCAGC ATCACACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 180 CCCTCTGGTT TCTCTCGCTT TAGTAGGAGC ATTGGTCAGC ATCACACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGAGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 181 CCCTCTGGTT TCTCTCGCTT TAGTAGGAGC ATTGGTCAGC ATCACACCGC AACAAAGTCA 60 TGCCGCCTTT TTTACAACCG TGATCATTCC AGCCATTGTT GGAGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 182 CCCTTTGGTT TCTCTCGCTT TAGTAGGAGC ATTGGTCAGC ATCACACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACGACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 183 CCCTCTGGTT TCTCTTGCTT TAGTAGGAGC GTTAGTCAGC ATCACACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 184 CCCTCTAGTT TCTCTCGCTT TAGTAGGAGC ATTGGTCAGC ATCACACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTG GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 185 CCCTTTAGTT TCTCTTGCTT TAGTAGGAGC ATTGGTCAGC ATCACACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 186 CCCTCTNGTT TCTCTCGCTT TAGTAGNAGC ATTNGTCAGC ATCACACCGC AACANAGTCA 60 TGCCGCCTTT TTCACAACCG TNATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 187 CCCTCTGGTT TCTCTCGCTT TAGTAGGAGC GTTGGTCAGC ATCACACCGC AACAAAGTCA 60 TGCCGCCTTT ATTACAACCG TGATCATTCC AGCCATTGTT GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 188 CCCTCTGGTT TCTCTTGCTT TAGTAGGAGC ATTGGTCAGC ATCACACCGC AACAAAGTCA 60 TGCCGCCTTT TTTACAACCG TGATTATTCC AGCCATTGTG GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 189 CCCTCTGGTT TCTCTTGCTT TAGTAGGAGC ATTGGTCAGC ATCACACCGC AACAAAGTCA 60 TGCCGCTCTT TTTACAACCG TGATTATTCC AGCCATTGTG GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 190 CCCTCTGGTT TCTCTTGCTT TAGTAGGAGC ATTGGTCAGC ATCACACCGC AACAAAGTCA 60 TGCCGCTCTT TTTACAACCG TGATTATTCC AGCCATTGTG GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 191 CCCTCTGGTT TCTCTCGCTT TAGTAGGAGC ATTGGTCAGC ATCACACCGC AACAAAGTCA 60 TGCCGCTCTT TTTACAACCG TGATTATTCC AGCCATTGTG GGGGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 192 CCCTYTGGTT TCTCTTGCTT TAGTAGGAGC ATTGRTTAGY RYCAYACCGC AACAAAGTCA 60 TGCCGCCTTT TTYACRACCG TGATCATTCC AGCCATTGTT GGRGG 105 105 base pairs nucleic acid single linear DNA (genomic) NO NO 193 CCCTATTATC TCTCTCGCTT TAGTAGGAGC ATTGGTCAGC ATCACACCGC AACAAAGTCA 60 TGCCGCCTTT TTCACAACCG TGTTCATTCC AGCCATTGTT GGGGG 105 362 base pairs nucleic acid single linear DNA (genomic) NO NO 194 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCACAAAGAG CGATAACGGG 120 CTAAACACTA GCACTTTGGA TTTTAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACG 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAACTGGC TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 195 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCACAAAGAG CGATAACGGG 120 CTAAACACTA GTGCTTTGGA TTTGAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACA 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCG 300 CACATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 196 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCACAAAGAG CGATAACGGG 120 CTAAACACTA GCGCTTTGGA TTTGAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACGTTAAA AACTTTGACA TTAAGGAATT GGTGGTTACA 240 ACCCGAGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCG 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 197 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCACAAAGAG CGATAACGGG 120 CTAAACACTA GCGCTTTGGA TTTGAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACGTTAAA AACTTTGACA TTAAGGAATT GGTGGTTACA 240 ACCCGAGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCG 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 198 GTGGATGCTC ATACAGCTTA TTTTAATGGT AATATTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCACAAAGAG CGATAACGGG 120 CTAAACACTA GCGCTTTGGA TTTCAGCGGC GTTACAGATA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAACGT GAACATTAAA AACTTTGACA TTAAGGAATT GGTGGTTACA 240 ACCCGAGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTGAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 199 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGTAAGAG CGATAACGGG 120 CTAAACACTA GCACTTTGGA TTTTAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACGTTAAA AACTTTGACA TTAAGGAATT GGTGGTTACA 240 ACCCGAGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATCAGTCT 300 CGCATTGGTG TCGTGAGTTT GCAAACGGGA TATAGCCCGG CTTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 200 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CTAGTAAGAG CGATAACGGG 120 CTAAACACTA GCGCTTTGGA TTTTAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT GGTGGTTACA 240 ACCCGAGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTGAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 2 base pairs nucleic acid single linear DNA (genomic) NO NO 201 WC 2 362 base pairs nucleic acid single linear DNA (genomic) NO NO 202 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATGTTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGCAAGAG CGATAACGGG 120 CTAAACACTA GTGCTTTGGA TTTTAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACA 240 ACCCGAGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTGAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 203 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATGTTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGCAAGAG CGATAACGGG 120 CTAAACACTA GTGCTTTGGA TTTTAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACA 240 ACCCGAGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTGAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 204 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATGTTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGCAAGAG CGATAACGGG 120 CTAAACACTA GTGCTTTGGA TTTTAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACA 240 ACCCGAGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTGAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 205 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATGTTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG CCCATAGCGC TCATTTTAAA AATATTGATG CCAGCAAGAG CGATAACGGG 120 CTAAACACTA GTGCTTTGGA TTTTAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACA 240 ACCCGAGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTGAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 206 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCACAAAGAG CGATAACGGG 120 CTAAACACTA GCGCTTTGGA TTTTAGTGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACA 240 ACCCGAGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCAG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 207 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGTAAGAG CGATAACGGG 120 CTAAACACTA GTGCTTTGGA TTTTAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT AGTTGTTACA 240 ACCCGAGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCAG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 208 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AAAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGTAAGAG CGATAATGGT 120 CTAAACACTA GCGCTTTGGA TTTTAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACA 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAATGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 209 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AAAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGTAAGAG CGATAACGGG 120 CTAAACACTA GCGCTTTGGA TTTGAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACAGCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT GGTTGTTACG 240 ACCCGTGTTC AGAGTTTTGG ACAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCGG CTTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 210 GTGGATGCTC ATACAGCTAA CTTTAATGGC AATATTTATT TGGGAAAATC CACGAATTTA 60 AGGGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGTAAGAG CGATAACGGG 120 CTAAACACTA GCTCTTTGGA TTTCAGTGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACGTTAAA AACTTTGACA TTAAGGAATT GGTGGTTACA 240 ACCCGAGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAATTATAGG CGATAAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 211 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATT TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGTAAGAG CGATAACGGG 120 CTAAACACTA GTGCTTTGGA TTTTAGCGGC GTTACAGATA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACGTTAAA AACTTTGACA TTAAGGAATT GGTGGTTACA 240 ACCCGAGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 212 GTGGATGCCC ATACGGCTAA CTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AAAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCACAAAGAG CGATAACGGG 120 CTAAACACTA GCGCTTTGGA TTTGAGTGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACA 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 213 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AAAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGTAAGAG CGATAATGGT 120 CTAAACACTA GTGCTTTGGA TTTGAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACAGCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACG 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG AGATCAATCG 300 CGCATTGGTG TCGTTAGTTT GCAAACTGGC TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 214 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AAAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCACTAAGAG CGATAATGGT 120 CTAAACACTA GCGCTTTGGA TTTGAGCGGC GTTACAAACA AGGTCAATAT CAACAAGCTC 180 ACTACAGCTG CCACTAATGT GTCCATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACG 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATCAATCG 300 CGCATTGGTG TCGTTAGTTT GCAAACTGGC TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 215 GTGGATGCTC ATACGGCTAA CTTTAATGGC AATGTTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGTAAGAG CGATAACGGG 120 CTAAACACTA GCGCTTTGGA TTTTAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT GGTGGTTACA 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 216 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATGTTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGCAAGAG CGATAACGGG 120 CTAAACACTA GCGCTTTGGA TTTTAGCGGC GTTACAGATA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACA 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTGAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 217 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATGTTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CTAGTAAGAG CGATAACGGG 120 CTAAACACTA GCGCTTTGGA TTTTAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACGTTAAA AACTTTGACA TTAAGGAATT AGTGGTTACA 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTGAGTTT GCAAACGGGA TATAGCCCAG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 218 GTGGACGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGCAAGAG CGATAACGGG 120 CTAAACACTA GCGCTTTGGA TTTCAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACGTTAAA AACTTTGACG TTAAGGAATT GGTGGTTACA 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTGAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 219 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGCAAGAG CGATAACGGG 120 CTAAACACTA GCACTTTGGA TTTTAGCGGC GTTACAGATA AAGTCAATAT CAACAAGCTC 180 ACTACAGCTG CCACTAATGT GAACGTTAAA AACTTTGACA TTAAGGAATT AGTGGTTACA 240 ACCCGAGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 220 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATT TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGTAAGAG CGATAACGGG 120 CTAAACACTA GCGCTTTGGA TTTTAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACGTTAAA AACTTTGACA TTAAGGAATT AGTGGTTACA 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 221 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATT TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGTAAGAG CGATAACGGG 120 CTAAACACTA GTGCTTTGGA TTTTAGCGGC GTTACAGATA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACGTTAAA AACTTTGACA TTAAGGAATT GGTGGTTACA 240 ACCCGAGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 222 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AGATTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGTAAGAG CGATAACGGG 120 CTAAACACTA GCGCTTTGGA TTTTAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACGTTAAA AACTTTGACA TTAAGGAATT GGTGGTTACA 240 ACCCGAGTTC AGAGTTTTGG GCAATACTCT ATTTTTGGCG AAAATATAGG CGATAAGTCG 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 223 GTGGACGCTC ATACAGCTTA TTTTAATGGC AATATTTATT TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCACAAAGAG CGATAACGGG 120 CTAAACACTA GCACTTTGGA TTTGAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACG 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTGAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 224 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCACAAAGAG CGATAACGGG 120 CTAAACATTA GCACTTTGGA TTTTAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACA 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 225 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTG 60 AGAGTGAATG GCCATAACGC TCATTTTAAA AATATTGATG CCAGTAAGAG CGATAACGGG 120 CTAAACACTA GCACTTTGGA TTTGAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACAGCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACG 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG TGATAAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 226 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATGTTTATT TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGTGAGAG CGATAACGGG 120 CTAAACACTA GCGCTTTGGA TTTTAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACGTTAAA AACTTTGACA TTAAGGAATT GGTGGTTACG 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TGTCGCCCGG CCTGTTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 227 GGGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATT TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGTAAGAG CGATAACGGG 120 CTAAACACTA GCGYTTTGGA TTTTAGCGGC GTTACAGAYA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACRTTAAA AACTTTGAYA TTAAGGAATT GGTGGTTACA 240 ACCCGAGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATMAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCRG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 228 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCACAAAGAG CGATAATGGT 120 ATAAACACTA GCACTTTGGA TTTGAGCGGC GTTACAGACA AGGTCAATAT CAACAAGCTC 180 ATTACAGCTT CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT GGTGGTTACA 240 ACCCGTGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTTAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 229 GTGGATGCTC ATACGGCTAA CTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTTAA AATATTGATG CCAGCAAGAG CGATAACGGG 120 CTAAACACTA GCACCTTGGA TTTCAGTGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACGTTAAA AACTTTGATA TTAAGGAATT GGTGGTTACA 240 ACCCGAGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCG 300 CGCATTGGTG TCGTGAGTTT GCAAACGGGA TATAGCCCAG CTTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 230 GTGGATGCTC ATACGGCTAA CTTTAATGGC AATATTTATT TGGGAAAATC CACGAATTTG 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGTAAGAG CGATAACGGG 120 CTAAACACTA GCGCTTTGGA TTTTAGCGGC GTTACAGACA AAGTTAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACGTTAAA AACTTTGACA TTAAGGAATT GGTGGTTACA 240 ACCCGAGTTC AAAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCT 300 CGCATTGGTG TCGTGAGTTT GCAAACGGGA TATAGCCCTG CTTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 231 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AGAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCAGTAAGAG CGATAACGGG 120 CTAAACACTA CCACTTTGGA TTTCAGCGGC GTTACAGATA AAGTCAATAT CAACAAGCTC 180 ACTACATCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACA 240 ACCCGAGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGCTG 300 CACATTGGTG TCGTGAGTTT GCAAACGGGA TATAGCCCAG CCTATTCTGG GGGGCTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 232 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AAAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CTAGTAAGAG CGATAACGGG 120 CTAAACACTA GCGCTTTGGA TTTGAGCGGC GTTACAAACA AGGTCAATAT CAACAAGCTC 180 ACTACAGCTG CCACTAATGT GAACATTAAA AACTTTGACA TTAAGGAATT GGTGGTTACA 240 ACCCGCGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATAAGTCG 300 CGCATTGGTG TCGTTAGTTT GCAAACTGGC TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 233 ATGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AAAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCACTAAGAG CGATAATGGT 120 CTAAACACTA GCGCTTTGGA TTTGAGCGGC GTTACAAACA AGGTCAATAT CAACAAGCTC 180 ACTACAGCTG CCACTAATGT GTCCATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACG 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATCAATCG 300 CGCATTGGTG TCGTTAGTTT GCAAACTGGC TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 234 GTGGATGCTC ATACAGCTTA TTTTAATGGC AATATTTATC TGGGAAAATC CACGAATTTA 60 AAAGTGAATG GCCATAGCGC TCATTTTAAA AATATTGATG CCACTAAGAG CGATAATGGT 120 CTATACACTA GCGCTTTGGA TTTGAGCGGC GTTACAAACA AGGTCAATAT TAACACGCTC 180 ACTACAGCTG CCACTAATGT GTCCATTAAA AACTTTGACA TTAAGGAATT AGTGGTTACG 240 ACCCGTGTTC AGAGTTTTGG GCAATACACT ATTTTTGGCG AAAATATAGG CGATCAATCG 300 CGCATTGGTG TCGTTAGTTT GCAAACTGGC TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 235 GTGGATGCCC ATACGATCAA TTTTAATGGC AATATGTATT TGGGAAGATT TACGCATTTA 60 AAAGTGAATG GTCATACAGC CAATTTTAAA GATATTGATG CCAGCAAGGG TAGAAATGGT 120 ATCGACACCA CCATTTTGGA TTTTAGCGGC GTTACAAACA AGGTCAATAT CAACAAGCTC 180 ACCACAGCTG CCACTAATGC GGCCATTAAA AATTTTGACA TTAAGGAATT GGTTGTTACA 240 ACCAATGTTT TGAGTGTGGG GAAATACACT GATTTTACCG AAGATATAGG CGATCAATCC 300 CGCATTGGTA TCGTGCGTTT GCAAATGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 236 GTGGATGCCC ATACGATCAA TTTTAATGGC AATATGTATT TGGGAAGATT CACGCATTTA 60 AAAGTGAATG GTCATACAGC CAATTTTAAA GATATTGATG CCAGCAAGGG TAGAAATGGT 120 ATCGACACCA CCATTTTGGA TTTTAGCGGC GTTACAAACA AGGTCAATAT CAACAAGCTC 180 ACCACAGCTG CCACTAATGC GGCCATTAAA AATTTTGACA TTAAGGAATT GGTTGTTACA 240 ACCAATGTTT TGAGTGTGGG GAAATACACT GATTTTACCG AAGATATAGG CGATCAATCC 300 CGCATTGGTA TCGTTAGTTT GCAAACGGGA TATAGCCCGG CCTATTCTGG GGGCGTTACT 360 TT 362 362 base pairs nucleic acid single linear DNA (genomic) NO NO 237 GTGGATGCCC ATACGATCAA TTTTAATGGC AACATGTATT TGGGAAGATT CACGCATTTA 60 AAAGTGAATG GCCATACAGC CAATTTTAAA GATATTGATG CCAGCAAGGG TAGAAATGGT 120 ATCGACACCA CTATTTTGGA TTTTAGCGGC GTTACAGACA AAGTCAATAT CAACAAGCTC 180 ACTACAGCTG CCACTAATGT GTCCATTAAA AACTTTGACA TTAAGGAATT GGTTGTTACA 240 ACCAATGTTT TGAGTGTGGG GAAATACACT GATTTTACCG AAGATATAGG CGATCAATCG 300 CACATTGGTG TCGTTAGTTT GCAAACTGGC TATAGCCCGG TCTATTCTGG GGGCGTTACT 360 TT 362 288 base pairs nucleic acid single linear DNA (genomic) NO NO 238 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AGGTAAGGTC AATATCAACA AGCTCATTAC GGCTTCCACT 120 AATGTGGCCG CTAAAAACTT CAACATTAAT GAATTGATTG TTAAAACCAA TGGGGTGAGT 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTAT TCTGGCGGTG TTAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 239 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AGGTAAGGTC AATATCAACA AGCTCATCAC AGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGATTG TTAAAACCAA TGGTGTGAGT 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTTT TCTGGGGGTG TTAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 240 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AGGTAAGGTC AATATCAACA AGCTCATCAC AGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGATTG TTAAAACCAA TGGGATAAGT 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGAAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG ATCAATCTTT TCTGGGGGTG TTAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 241 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GCAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGCGTTAC AGACAAGGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCA TTAAAAACTT CAACATTAAT GAATTGTTGG TTAAGACCAA TGGGGTGAGC 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTTT TCTGGGGGTG TCAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 242 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGCGTTAC AGACAAGGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCA TTAAAAACTT CAACATTAAT GAATTGGTTG TTAAGACCAA TGGGGTGAGT 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAACACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTAT TCTGGGGGTG TTAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 243 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AGGTAAGGTC AATATCAACA AGCTCATCAC GGCTTCCACT 120 AATGTGGCCG TTAAAAACAA CAACATTAAT GAATTGGTGG TTAAAACCAA TGGGATAAGT 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CAGGCACTAG GTCAATCTTT TCTGGGGGTG TCAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 244 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACTG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AAACAAAGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGTTGG TTAAGATTAA TGGGGTGAGT 180 GTGGGGGAAT ACACTTATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAACACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTAT TCTGGCGGTG TTAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 245 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACTG GTAATGGTGG TTTCAACACC 60 TTAGATTTCA GTGGTGTTAC AGACAAGGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCA TTAAAAACTT CAACATTAAT GAATTGGTGG TTAAAACCAA TGGTATAAGC 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAACACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTAT TCTGGCGGTG TTAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 246 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACTG GTAATGGTGG TTTCAACACC 60 TTAGATTTCA GTGGTGTTAC AGACAAGGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCA TTAAAAACTT CAACATTAAT GAATTGGTGG TTAAAACCAA TGGTATAAGC 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTAT TCTGGCGGTG TTAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 247 GTGGATGCCC ATACAGCTAA TTTTAAAGGT ATTGATACTG GTAATGGTGG TTTCAACACC 60 TTAGATTTCA GTGGCGTTAC AAACAAAGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCA TTAAAAACTT CAACATTAAT GAATTGGTGG TTAAAACCAA TGGTATAAGC 180 GTGGGGGAAT ACACTAATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAACACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTAT TCTGGCGGTG TTAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 248 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AGGTAAGGTC AATATCAACA AGCTCATCAC AGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGATTG TTAAAACCAA TGGGGTGAGT 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTAT TCTGGCGGTG TTAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 249 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AGGTAAGGTC AATATCAACA AGCTCATCAC AGCTTCCACT 120 AATGTGGCCG CTAAAAACTT CAACATTAAT GAATTGATTG TTAAAACCAA TGGGGTGAGT 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTAT TCTGGCGGTG TTAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 250 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AGGTAAGGTC AATATCAACA AGCTCATCAC AGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGATTG TTAAAACCAA TGGTGTGAGT 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAACACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTAT TCTGGCGGTG TTAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 251 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AGGTAAGGTC AATATTAACA AGCTCATTAC GGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGATTG TTAAAACCAA TGGGGTGAGT 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTTT TCTGGGGGTG TTAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 252 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTGAC AGGTATAGTC AATATCAACA AGCTCATCAC AGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGATTG TTAAAACCAA TGGGGTGAGC 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTTT TCTGGGGGTG TCAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 253 GTGGATGGTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AGGTAAGGTC AATATCAACA AGCTCATAAC GGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGATTG TTAAAACCAA TGGTGTGAGT 180 GTGGGGGAAT ACACTTATTT TAGCGAAGAT ATAGGCAGTC AATCGCACAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTTT TCTGGGGGTG TCAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 254 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AGGTAAGGTC AATATCAACA AGCTCATTAC GGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGATTG TTAAAACCAA TGGTGTGAGT 180 GTGGGGGAAT ACACTTATTT TAGCGAAGAT ATAGGCAGTC AATCGCACAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTTT TCTGGGGGTG TCAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 255 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AGACAAAGTC AATATCAACA AGCTCATCAC AGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGATTG TTAAAACCAA TGGGGTGAGT 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACCAG GTCAATCTTT TCTGGGGGTG TCAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 256 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AGACAAAGTC AATATCAACA AGCTCATTAC GGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGATTG TTAAAACCAA TGGGGTGAGT 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTTT TCTGGGGGTG TCAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 257 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AGACAAAGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGGTTG TTAAAACCAA TGGGGTAAGT 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCACAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTTT TCTGGGGGTG TCAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 258 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AAACAAGGTC AATATCAACA AGCTCATTAC GGCTTCCACT 120 AATGTGGCCA TTAAAAACTT CAACATTAAT GAATTGATTG TTAAAACCAA TGGGATGAGC 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTTT TCTGGGGGTG TCAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 259 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AAACAAGGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGATTG TTAAAACCAA TGGGGTGAGC 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTTT TCTGGGGGTG TCAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 260 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AAACAAGGTC AATATCAACA AGCTCATTAC GGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGATTG TTAAAACCAA TGGGGTGAGC 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTTT TCTGGGGGTG TCAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 261 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTGGATTTTA GTGGCGTTAC AGACAAAGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCA TTAAAAACTT CAACATTAAT GAATTGTTGG TTAAGACCAA TGGGGTGAGT 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAACACCGTG 240 CGTTTAGAAA CTGGCACTAG GTCAATCTTT TCTGGGGGTG TCAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 262 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AGGTAAGGTC AATATCAACA AGCTCATTAC GGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGTTGG TTAAGACCAA TGGGGTGAGT 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTTT TCTGGGGGTG TCAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 263 GTGGATGCTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AGGTAAGGTC AATATCAACA AGCTCATTAC GGCTTCCACT 120 AATGTAGCCG TTAAAAACTT CAACATTAAT GAATTGTTGG TTAAGACCAA TGGGGTGAGT 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAACACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTTT TCTGGGGGTG TCAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 264 GTGGATGGTC ATACAGCTAA TTTTAAAGGT ATTGATACGG GTAATGGTGG TTTCCACACC 60 TTAGATTTTA GTGGTGTTAC AGGTAAGGTC CATATCCACA AGCTCATTAC GGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CCACATTAAT GAATTGATTG GTAAAACCAA TGGGATAAGT 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTTT TCTGGGGGTG TCAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 265 GCGAGCGCTC ATACGGTCAA TTTTAAAGAT ATTGATACTG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AAACAAGGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCA TTAAAAACTT CAACATTAAT GAATTGTTGG TTAAAACCAA TGGGATAAGT 180 GTGGGGGAAT ACACTAATTT TAGCGAAGAT ATAGGCAATC AATCGCGCAT CAACACCGTG 240 CGTTTGGAAA CTGGCACCAG GTCAATCTAT TTTGGGGGTG TTAAATTA 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 266 GCGAGCGCTC ATACGGTCAA TTTTAAAGAT ATTGATACTG GTAATGGTGG TTTCAACACC 60 TTAGACTTTA GTGGTGTTAC AAACAAGGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGTTGG TTAAAACCAA TGGGATAAGT 180 GTGGGGGAAT ACACTAATTT TAGCGAAGAT ATAGGCAATC AATCGCGCAT CAACACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTAT TCTGGGGGTG TTAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 267 GCGAGCGCTC ATACGGTCAA TTTTAAAGAT ATTGATACTG GTAATGGTGG TTTCAACACC 60 TTAGACTTTA GTGGTGTTAC AAACAAGGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCA TTAAAAACTT CAACATTAAT GAATTGTTGG TTAAAACCAA TGGGATAAGT 180 GTGGGGGAAT ACACTAATTT TAGCGAAGAT ATAGGCAATC AATCGCACAT CAACACCGTG 240 CGTTTAGAAA CTGGCACTAG GTCAATCTAT TCTGGGGGTG TTAAGTTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 268 GCGAGACGTC ATACGGTCAA TTTTAAAGAT ATTGATACTG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AAACAAGGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAGT GAATTGTTGG TTAAAACCAA TGGGATAAGT 180 GTGGGGGAAT ACACTAATTT TAGCGGAGAT ATAGGCAATC AATCGCGCAT CAACACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTAT TCTGGGGGTG TTAAGTTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 269 GTGGATGCCC ATACGGTCAA TTTTAAAGGT ATTGATACTG GTAATGGTGG TTTCAACACC 60 TTAGATTTCA GTGGTGTTAC AGACAAGGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCA TTAAAAACTT CAACATTAAT GAATTGGTGG TTAAAACCAA TGGTATAAGC 180 GTGGGGGAAT ACACTCATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTAT TCTGGCGGTG TTAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 270 GCGAGCGCTC ATACGGTCAA TTTTAAAGAT ATTGATACTG GTAATGGTGG TTTCAACACC 60 TTAGACTTTA GTGGTGTTAC AAACAAGGTC AATATCAACA AACTCATTAC AGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGTTGG TTAAAACCAA TGGGATAAGT 180 GTGGGGGAAT ACACTAATTT TAGCGAAGAT ATAGGCAATC AATCGCGCAT CAACACCGTG 240 CGTTTAGAAA CTGGCACCAG GTCAATCTAT TCTGGGGGTG TTAAGTTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 271 GCGAGCGCTC ATACGGTCAA TTTTAAAGAT ATTGATACTG GTAATGGTGG TTTCAACACC 60 TTAGACTTTA GTGGTGTTAC AAACAAGGTC AATATCAACA AACTCATTAC AGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGTTGG TTAAAACCAA TGGGATAAGT 180 GTGGGGGAAT ACACTAATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAATACCGTG 240 CGTTTAGAAA CTGGCACCAG GTCAATCTAT TCTGGGGGTG TTAAGTTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 272 GCGAGCGCTC ATACGGTCAA TTTTAAAGAT ATTGATACTG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AAACAAGGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCA TTAAAAACTT CAACATTAAT GAATTGTTGG TTAAAACCAA TGGGATAAGT 180 GTGGGGGAAT ACACTAATTT TAGCGAAGAT ATAGGCAATC AATCGCGCAT CAATACCGTG 240 CGTTTAGAAA CTGGCACTAG GTCAATCTAT TCTGGGGGTG TTAAGTTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 273 GCGAGCGCTC ATACGGTCAA TTTTAAAGAT ATTGATACTG GTAATGGTGG TTTCAACACC 60 TTAGATTTTA GTGGTGTTAC AAACAAGGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGTTGG TTAAAACCAA TGGGATAAGT 180 GTGGGGGAAT ACACTAATTT TAGCGAAGAT ATAGGCAATC AATCGCGCAT CAACACCGTG 240 CGTTTGGAAA CTGGCACTAG GTCAATCTAT TCTGGGGGTG TTAAGTTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 274 GCGAGCGCTC ATACGGTCAA TTTTAAAGAT ATTGATACTG GTAATGGTGG TTTCAACACC 60 TTAGACTTTA GTGGTGTTAC AAACAAGGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCG TTAAAAACTT CAACATTAAT GAATTGTTGG TTAAAACCAA TGGGATAAGT 180 GTGGGGGAAT ACACTAATTT TAGCGAAGAT ATAGGCAATC AATCGCGCAT CAACACCGTG 240 CGTTTAGAAA CTGGCACTAG GTCAATCTAT TCTGGGGGTG TTAAATTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 275 GCGAGCGCTC ATACGGTCAA TTTTAAAGAT ATTGATACTG GTAATGGTGG TTTCAATACC 60 TTAGACTTTA GTGGTGTTAC AAACAAGGTC AATATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCA TTAAAAATTT CAACATTAAT GAATTGTTGG TTAAAACCAA TGGGATAAGT 180 GTGGGGGAAT ACACTAATTT TAGCGAAGAT ATAGGCAGTC AATCGCGCAT CAACACCGTG 240 CGTTTAGAAA CTGGCACTAG GTCAATCTAT TCTGGGGGTG TTAAGTTT 288 288 base pairs nucleic acid single linear DNA (genomic) NO NO 276 ACGAGCGCTC ATACGGTCAA TTTTAAAGAT ATTGATACTG GTAATGGTGG TTTCAACACC 60 TTAGACTTTA GTGGTGTTAC AAACAAGGTC AACATCAACA AGCTCATTAC AGCTTCCACT 120 AATGTGGCCA TTAAAAACTT CAACATTAAT GAGTTGTTGG TTAAAACCAA TGGGATAAGT 180 GTGGGGGAAT ACACTAATTT TAGCGAAGAT ATAGGCAATC AATCGCGCAT CAACACCGTG 240 CGTTTAGAAA CTGGCACTAG GTCAATCTAT TCTGGGGGTG TTAAGTTT 288 21 base pairs nucleic acid single linear DNA (genomic) NO NO 277 ATGGAAATAC AACAAACACA C 21 19 base pairs nucleic acid single linear DNA (genomic) NO NO 278 CTGCTTGAAT GCGCCAAAC 19 21 base pairs nucleic acid single linear DNA (genomic) NO NO 279 CACAGCCACT TTCAATAACG A 21 20 base pairs nucleic acid single linear DNA (genomic) NO NO 280 CGTCAAAATA ATTCCAAGGG 20 

1. Method for the detection and/or typing of Helicobacter pylori (H.pylori) strains present in a sample comprising the steps of: (i) if need be releasing, isolating or concentrating the polynucleic acids in the sample; (ii) amplifying the polynucleic acids of relevant target regions of the vacA gene and possibly other virulence determinant genes (VDG), with suitable primer pairs, said primers being generally applicable on different H.pylori strains, allowing to amplify said relevant target regions of the VDG preferentially in compatible amplification conditions; (iii) hybridizing the polynucleic acids obtained in (i) or (ii) with a set of at least two VDG-derived probes, under appropriate hybridization and wash conditions, and with at least one of said probes hybridizing to a conserved region of a VDG of H.pylori, and with at least one of said probes hybridizing to a variable region of vacA; (iv) detecting the hybrids formed in step (iii); (v) detecting and/or typing H.pylori strains present in a sample from the differential hybridization signals obtained in step (iv), with said typing being the allele-specific detection of a strain according to the VDG alleles present in that particular H.pylori strain, and the said virulence determinant genes being the genetic elements involved in enabling, determining, and marking of the infectivity and/or pathogenicity of said H.pylori strain.
 2. Method according to claim 1 wherein step (ii) consists of amplifying the polynucleic acids of relevant target regions in the vacA and cagA gene with suitable primers, said primers being generally applicable on different H. pylori strains, allowing to amplify said relevant target regions in compatible amplification conditions, with said target regions being a conserved region in the case of the cagA alleles and a variable region in the case of the vacA alleles, and with said primers being preferentially chosen from the following list: - cagF (SEQ ID N012) cagR (SEQ ID NO13) VA1XR (SEQ ID NO14) VA1F (Atherton et al, 1995) M1F (SEQ ID NO15) M1R (SEQ ID NO16) HPMGF (SEQ ID NO 17) HPMGR (SEQ ID NO 18) cagSF (SEQ ID NO 19) cagSR (SEQ ID NO 20) cagEN1 (SEQ ID NO 21) cagRN1 (SEQ ID NO 22) VAMSFb (SEQ ID NO 23) VAMSFc (SEQ ID NO 24) VAMSFd (SEQ ID NO 25) VAMSFe (SEQ ID NO 26)

or sequence variants thereof, with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—, by others (including modified nucleotides such as inosine), or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize/amplify specifically with the same specificity as the oligonucleotide primers from which they are derived.
 3. Method according to any of claims 1 and 2, wherein step (iii) consists of hybridizing the polynucleic acids obtained in step (ii) with a set of probes, under appropriate hybridization and wash conditions, said set of probes being preferentially applicable in a simultaneous hybridisation assay and comprising at least one probe hybridizing to a conserved region of the cagA gene of H.pylori and at least one probe hybridizing to a variable region of the vacA gene of H.pylori, and more preferentially said set of probes comprising at least one of the following cagA- and vacA-derived probes: cag A-derived probe(s): cagApro (SEQ ID NO1) cagprobe3 (SEQ ID NO 27) vacA-derived probe(s): P1S1 (SEQ ID NO2) P22S1a (SEQ ID NO3) P1S1b (SEQ ID NO4) P2S1b (SEQ ID NO5) P1S2(VAS2) (SEQ ID NO6) P2S2 (SEQ ID NO7) P1M1 (SEQ ID NO8) P2M1 (SEQ ID NO9) P1M2 (SEQ ID NO10) P2M2 (SEQ ID NO11) P3S1 (SEQ ID NO 28) P4S1 (SEQ ID NO 29) P1M1new (SEQ ID NO 30) P2M1new (SEQ ID NO 31) P1M2new (SEQ ID NO 32) P2M2new (SEQ ID NO 33) P1M3 (SEQ ID NO 34)

or sequence variants thereof, with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—, by others (including modified nucleotides such as inosine), or with said variants consisting of the complement of any of the above-mentioned oligonucleotide probes, or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize specifically with the same specificity as the oligonucleotide probes from which they are derived.
 4. Method for the detection of H.pylori strains present in a sample comprising the steps of: (i) if need be releasing, isolating or concentrating the polynucleic acids in the sample; (ii) amplifying the polynucleic acids of a relevant target region of the vacA gene with a suitable primer pair, said primer pair being generally applicable on different H.pylori strains, allowing to amplify said relevant target region of the vacA gene preferentially in compatible amplification conditions; (iii) hybridizing the polynucleic acids obtained in (i) or (ii) with at least one probe hybridizing to a conserved region of the vacA gene; (iv) detecting the hybrids formed in step (iii); (v) determining the presence or absence of H.pylori in a sample from the hybridization signals obtained in step (iv).
 5. Method according to claim 4 wherein step (ii) consists of amplifying the polynucleic acids of a relevant target region in the vacA gene with suitable primers, said primers being generally applicable on different H. pylori strains, allowing to amplify said relevant target region in compatible amplification conditions, with said target region being a conserved region, with said primers preferentially being VA1F and VA1XR (SEQ ID NO14), or sequence variants thereof with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—, by others (including modified nucleotides such as inosine), or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize/amplify specifically with the same specificity as the oligonucleotide primers from which they are derived.
 6. Method according to any of claims 4 and 5, wherein step (iii) consists of hybridizing the polynucleic acids obtained in step (ii) with a set of probes, under appropriate hybridization and wash conditions, said set of probes being preferentially applicable in a simultaneous hybridisation assay and comprising at least one probe hybridizing to a conserved region of the vacA gene of H.pylori, and more preferentially said set of probes comprising at least one of the following vacA-derived probes: HpdiaS1 (SEQ ID NO 35) HpdiaS2 (SEQ ID NO 36) HpdiaS3 (SEQ ID NO 37) HpdiaS4 (SEQ ID NO 38) HpdiaS5 (SEQ ID NO 39)

or sequence variants thereof, with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—, by others (including modified nucleotides such as inosine), or with said variants consisting of the complement of any of the above-mentioned oligonucleotide probes, or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize specifically with the same specificity as the oligonucleotide probes from which they are derived.
 7. Method according to any of claims 1 to 6, characterized further in that step (iii) is a reverse hybridization step, with the probes being immobilized, preferably as parallel lines, on a solid support, preferably a membrane strip.
 8. Method according to any of claims 1 to 6, further characterized in that the polynucleic acids obtained in step (ii) are immobilized on a solid support, preferably a microtiter plate, and that the subsequent hybridization of step (iii) is carried out on said solid support.
 9. A probe composition for use in a method according to any of claims 1 to 3, said composition comprising at least one probe hybridizing to a conserved region of a VDG of H.pylori, and at least one probe hybridizing to a variable region of vacA, and more preferentially said probes being derived from the polynucleic acid sequences of the vacA and/or cagA gene of H.pylori, and most preferentially said probes being chosen from the following cag A-derived probe(s): cagApro (SEQ ID NO1) cagprobe3 (SEQ ID NO 27) vacA-derived probe(s): P1S1 (SEQ ID NO2) P22S1a (SEQ ID NO3) P1S1b (SEQ ID NO4) P2S1b (SEQ ID NO5) P1S2(VAS2) (SEQ ID NO6) P2S2 (SEQ ID NO7) P1M1 (SEQ ID NO8) P2M1 (SEQ ID NO9) P1M2 (SEQ ID NO10) P2M2 (SEQ ID NO11) P3S1 (SEQ ID NO 28) P4S1 (SEQ ID NO 29) P1M1new (SEQ ID NO 30) P2M1new (SEQ ID NO 31) P1M2new (SEQ ID NO 32) P2M2new (SEQ ID NO 33) PLM3 (SEQ ID NO 34)

or sequence variants thereof, with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—, by others (including modified nucleotides such as inosine), or with said variants consisting of the complement of any of the above-mentioned oligonucleotide probes, or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize specifically with the same specificity as the oligonucleotide probes from which they are derived.
 10. A probe composition for use in a method according to any of claims 4 to 6, said composition comprising at least one probe hybridizing to a conserved region of the vacA gene of H.pylori, and most preferentially said probe being chosen from the following list: HpdiaS1 (SEQ ID NO 35) HpdiaS2 (SEQ ID NO 36) HpdiaS3 (SEQ ID NO 37) HpdiaS4 (SEQ ID NO 38) HpdiaS5 (SEQ ID NO 39)

or sequence variants thereof, with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—, by others (including modified nucleotides such as inosine), or with said variants consisting of the complement of any of the above-mentioned oligonucleotide probes, or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize specifically with the same specificity as the oligonucleotide probes from which they are derived.
 11. A composition comprising at least one suitable oligonucleotide amplification primer, allowing to amplify the polynucleic acids of the relevant target regions of the respective VDG, said suitable primers being generally applicable with different H.pylori strains and allowing the amplification of said relevant target regions to be used in compatible amplification conditions, and more preferentially said primers allowing the amplification of a conserved target region of the cagA gene and a region of the vacA gene comprising conserved and/or variable target regions, and most preferentially said primers being selected from the following list: cagF (SEQ ID NO12) cagR (SEQ ID NO13) VA1XR (SEQ ID NO14) M1F (SEQ ID NO15) M1R (SEQ ID NO16) HPMGF (SEQ ID NO 17) HPMGR (SEQ ID NO 18) cagSF (SEQ ID NO 19) cagSR (SEQ ID NO 20) cagFN1 (SEQ ID NO 21) cagRN1 (SEQ ID NO 22) VAMSFb (SEQ ID NO 23) VAMSFc (SEQ ID NO 24) VAMSFd (SEQ ID NO 25) VAMSFe (SEQ ID NO 26)

or sequence variants thereof, with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—, by others (including modified nucleotides such as inosine), or with said variants consisting of the complement of any of the above-mentioned oligonucleotide primers, or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize specifically with the same specificity as the oligonucleotide primers from which they are derived.
 12. Probe being derived from the polynucleic acid sequences of the vacA and/or cagA gene of H.pylori, and with said probe being chosen from the following list: cagApro (SEQ ID NO1) cagprobe3 (SEQ ID NO 27) P1S1 (SEQ ID NO2) P22S1a (SEQ ID NO3) P1S1b (SEQ ID NO4) P2S1b (SEQ ID NO5) P1S2(VAS2) SEQ ID NO6) P2S2 (SEQ ID NO7) P1M1 (SEQ ID NO8) P2M1 (SEQ ID NO9) P1M2 (SEQ ID NO10) P2M2 (SEQ ID NO11) P3S1 (SEQ ID NO 28) P4S1 (SEQ ID NO 29) P1M1new (SEQ ID NO 30) P2M1new (SEQ ID NO 31) P1M2new (SEQ ID NO 32) P2M2new (SEQ ID NO 33) P1M3 (SEQ ID NO 34) HpdiaS1 (SEQ ID NO 35) HpdiaS2 (SEQ ID NO 36) HpdiaS3 (SEQ ID NO 37) HpdiaS4 (SEQ ID NO 38) HpdiaS5 (SEQ ID NO 39)

or sequence variants thereof, with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—, by others (including modified nucleotides such as inosine), or with said variants consisting of the complement of any of the above-mentioned oligonucleotide probes, or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize specifically with the same specificity as the oligonucleotide probes from which they are derived.
 13. Oligonucleotide amplification primer allowing the amplification of a region of the cagA gene or a region of the vacA gene of H.pylori, and with said primer being selected from the following list: cagF (SEQ ID NO12) cagR (SEQ ID NO13) VA1XR (SEQ ID NO14) M1F (SEQ ID NO15) M1R (SEQ ID NO16) HPMGF (SEQ ID NO 17) HPMGR (SEQ ID NO 18) cagSF (SEQ ID NO 19) cagSR (SEQ ID NO 20) cagFN1 (SEQ ID NO 21) cag1RN1 (SEQ ID NO 22) VAMSFb (SEQ ID NO 23) VAMSEC (SEQ ID NO 24) VAMSFd (SEQ ID NO 25) VAMSFe (SEQ ID NO 26)

or sequence variants thereon, with said sequence variants containing deletions and/or insertions and/or substitutions of one or more nucleotides, mainly at their extremities (either 3′ or 5′), and or substitutions of non-essential nucleotides,—being nucleotides not essential in discriminating between alleles—, by others (including modified nucleotides such as inosine), or with said variants consisting of the complement of any of the above-mentioned oligonucleotide primers, or with said variants consisting of ribonucleotides instead of deoxyribonucleotides, all provided that the variants can hybridize/amplify specifically with the same specificity as the oligonucleotide primers from which they are derived.
 14. A method according to any of claims 1 to 6 for the detection and/or typing of alleles of VDG of H.pylori, more preferentially alleles of the cagA and vacA gene of H. pylori, present in a sample using a set of probes and/or primers specially designed to detect and/or to amplify and/or to type the said alleles, with said probes and primers being defined in any of claims 7 to
 11. 15. A solid support, preferentially a membrane strip, carrying on its surface, at least one probe according to any of claims 7, 8 and 10, coupled to said support.
 16. A kit for detecting and/or typing H. pylori strains in a sample liable to contain it, comprising the following components: when appropriate at least one oligonucleotide primer according to any of claims 9 and 11; at least one probe according to any of claims 7, 8 and 10, with said probe and/or other probes applied being by preference immobilized on a solid support; a buffer or components necessary to produce the buffer enabling an amplification or a hybridization reaction between these probes and the amplified products; when appropriate a means for detecting the hybrids resulting from the preceding hybridization.
 17. An isolated vacA polynucleic acid sequence defined by SEQ ID NO 40 to 91 and SEQ ID NO 115 to 276 or any fragment thereof, that can be used as a primer or as a probe in a method for detection and/or typing of a vacA allele of H. pylori.
 18. An isolated cagA polynucleic acid sequence defined by SEQ ID NO 92 to 114 or any fragment thereof, that can be used as a primer or as a probe in a method for detection and/or typing of a cagA allele of H. pylori.
 19. A vacA protein fragment encoded by any of the nucleic acids with SEQ ID NO 40 to 91 and SEQ ID NO 115 to 276 or any subfragment of said vacA protein fragment, with said subfragment consisting of at least 5 contiguous amino acids of a vacA protein.
 20. A cagA protein fragment encoded by any of the nucleic acids with SEQ ID NO 92 to 114, or any subfragment of said cagA protein fragment, with said subfragment consisting of at least 5 contiguous amino acids of a cagA protein. 